Search or ask a question

Showing papers by "Yap-Peng Tan published in 2023"

PDF

Open Access

Journal Article•DOI•

Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger

[...]

Yi Yu, Yufei Wang, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot - Show less +2 more

28 Feb 2023-arXiv.org

TL;DR: Wang et al. as mentioned in this paper proposed a frequency-based trigger injection model that adds triggers in the discrete cosine transform (DCT) domain to attack compression quality in terms of bit-rate and reconstruction quality.

...read moreread less

Abstract: Recent deep-learning-based compression methods have achieved superior performance compared with traditional approaches. However, deep learning models have proven to be vulnerable to backdoor attacks, where some specific trigger patterns added to the input can lead to malicious behavior of the models. In this paper, we present a novel backdoor attack with multiple triggers against learned image compression models. Motivated by the widely used discrete cosine transform (DCT) in existing compression systems and standards, we propose a frequency-based trigger injection model that adds triggers in the DCT domain. In particular, we design several attack objectives for various attacking scenarios, including: 1) attacking compression quality in terms of bit-rate and reconstruction quality; 2) attacking task-driven measures, such as down-stream face recognition and semantic segmentation. Moreover, a novel simple dynamic loss is designed to balance the influence of different loss terms adaptively, which helps achieve more efficient training. Extensive experiments show that with our trained trigger injection models and simple modification of encoder parameters (of the compression model), the proposed attack can successfully inject several backdoors with corresponding triggers in a single image compression model.

...read moreread less

2 citations

Proceedings Article•DOI•

Temporal Coherent Test-Time Optimization for Robust Video Classification

[...]

Chen Yi, Siyuan Yang, Yufei Wang, Haoliang Li, Yap-Peng Tan, Alex C. Kot - Show less +2 more

28 Feb 2023

TL;DR: Temporal Coherent Test-time Optimization framework (TeCo) as discussed by the authors utilizes spatio-temporal information in test-time optimization for robust video classification and achieves significant improvements in corruption robustness across Mini Kinetics-C and Mini SSV2-C.

...read moreread less

Abstract: Deep neural networks are likely to fail when the test data is corrupted in real-world deployment (e.g., blur, weather, etc.). Test-time optimization is an effective way that adapts models to generalize to corrupted data during testing, which has been shown in the image domain. However, the techniques for improving video classification corruption robustness remain few. In this work, we propose a Temporal Coherent Test-time Optimization framework (TeCo) to utilize spatio-temporal information in test-time optimization for robust video classification. To exploit information in video with self-supervised learning, TeCo uses global content from video clips and optimizes models for entropy minimization. TeCo minimizes the entropy of the prediction based on the global content from video clips. Meanwhile, it also feeds local content to regularize the temporal coherence at the feature level. TeCo retains the generalization ability of various video classification models and achieves significant improvements in corruption robustness across Mini Kinetics-C and Mini SSV2-C. Furthermore, TeCo sets a new baseline in video classification corruption robustness via test-time optimization.

...read moreread less

1 citations

Journal Article•DOI•

Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

[...]

Junjie Ma, Yaping Dai, Zhiyang Jia, Fuchun Sun, Yap-Peng Tan, Jun Lu - Show less +2 more

01 Apr 2023-Pattern Recognition

TL;DR: Zhang et al. as discussed by the authors proposed a multi-pathway zoom network (MZNet) which recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance.

...read moreread less

Journal Article•DOI•

POAR: Towards Open-World Pedestrian Attribute Recognition

[...]

Yue Zhang, Suchen Wang, Shichao Kan, Zhenyu Weng, Yigang Cen, Yap-Peng Tan - Show less +2 more

26 Mar 2023-arXiv.org

TL;DR: Li et al. as mentioned in this paper formulated the pedestrian attribute recognition problem as an image-text search problem, and designed a Transformer-based image encoder with a masking strategy.

...read moreread less

Abstract: Pedestrian attribute recognition (PAR) aims to predict the attributes of a target pedestrian in a surveillance system. Existing methods address the PAR problem by training a multi-label classifier with predefined attribute classes. However, it is impossible to exhaust all pedestrian attributes in the real world. To tackle this problem, we develop a novel pedestrian open-attribute recognition (POAR) framework. Our key idea is to formulate the POAR problem as an image-text search problem. We design a Transformer-based image encoder with a masking strategy. A set of attribute tokens are introduced to focus on specific pedestrian parts (e.g., head, upper body, lower body, feet, etc.) and encode corresponding attributes into visual embeddings. Each attribute category is described as a natural language sentence and encoded by the text encoder. Then, we compute the similarity between the visual and text embeddings of attributes to find the best attribute descriptions for the input images. Different from existing methods that learn a specific classifier for each attribute category, we model the pedestrian at a part-level and explore the searching method to handle the unseen attributes. Finally, a many-to-many contrastive (MTMC) loss with masked tokens is proposed to train the network since a pedestrian image can comprise multiple attributes. Extensive experiments have been conducted on benchmark PAR datasets with an open-attribute setting. The results verified the effectiveness of the proposed POAR method, which can form a strong baseline for the POAR task.

...read moreread less