Wenzhao Xiang

Institute of Computing Technology

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Hang Su

Tsinghua University

Chang Liu

East China Normal University

Yinpeng Dong

Tsinghua University

Shibao Zheng

Shanghai Jiao Tong University

Shibao Zheng

Shanghai Jiao Tong University

Hui Xue

Tianjin Medical University General Hospital

Hang Su

Tsinghua University

Jun Zhu

Tsinghua University

Yandong Guo

Bohai University

Yuan He

Alibaba Group (United States)

Cooperative Institutions

Tsinghua University

Chinese Academy of Sciences

Shanghai Jiao Tong University

Alibaba Group (China)

Beijing Institute of Technology

Huazhong University of Science and Technology

University of Science and Technology of China

University of Electronic Science and Technology of China

PLA Army Engineering University

Alibaba Group (United States)

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Improving Model Generalization by On-Manifold Adversarial Augmentation in the Frequency Domain

Chang Liu Wenzhao Xiang Yuan He Hui Xue Shibao Zheng

Download This Paper Open PDF in Browser Add Paper to My Library Share: Permalink Using these links will ensure access to this page indefinitely Copy URL Copy DOI

Manifold (fluid mechanics)

10.2139/ssrn.4768680

Cite

Citations (0)

RobustPrompt: Learning to defend against adversarial attacks with adaptive visual prompts

Pattern Recognition Letters (2025)

Chang Liu Wenzhao Xiang Yinpeng Dong Xingxing Zhang Liyuan Wang

10.1016/j.patrec.2025.02.015

Cite

Citations (0)

Improving the Robustness of Adversarial Attacks Using an Affine-Invariant Gradient Estimator

arXiv (Cornell University) (2021)

Wenzhao Xiang Hang Su Chang Liu Yandong Guo Shibao Zheng

As designers of artificial intelligence try to outwit hackers, both sides continue to hone in on AI's inherent vulnerabilities. Designed and trained from certain statistical distributions of data, AI's deep neural networks (DNNs) remain vulnerable to deceptive inputs that violate a DNN's statistical, predictive assumptions. Before being fed into a neural network, however, most existing adversarial examples cannot maintain malicious functionality when applied to an affine transformation. For practical purposes, maintaining that malicious functionality serves as an important measure of the robustness of adversarial attacks. To help DNNs learn to defend themselves more thoroughly against attacks, we propose an affine-invariant adversarial attack, which can consistently produce more robust adversarial examples over affine transformations. For efficiency, we propose to disentangle current affine-transformation strategies from the Euclidean geometry coordinate plane with its geometric translations, rotations and dilations; we reformulate the latter two in polar coordinates. Afterwards, we construct an affine-invariant gradient estimator by convolving the gradient at the original image with derived kernels, which can be integrated with any gradient-based attack methods. Extensive experiments on ImageNet, including some experiments under physical condition, demonstrate that our method can significantly improve the affine invariance of adversarial examples and, as a byproduct, improve the transferability of adversarial examples, compared with alternative state-of-the-art methods.

Robustness

Deep Neural Networks

10.48550/arxiv.2109.05820

Cite

Citations (0)

You Cannot Easily Catch Me: A Low-Detectable Adversarial Patch for Object Detectors

arXiv (Cornell University) (2021)

Zijian Zhu Hang Su Chang Liu Wenzhao Xiang Shibao Zheng

Blind spots or outright deceit can bedevil and deceive machine learning models. Unidentified objects such as digital "stickers," also known as adversarial patches, can fool facial recognition systems, surveillance systems and self-driving cars. Fortunately, most existing adversarial patches can be outwitted, disabled and rejected by a simple classification network called an adversarial patch detector, which distinguishes adversarial patches from original images. An object detector classifies and predicts the types of objects within an image, such as by distinguishing a motorcyclist from the motorcycle, while also localizing each object's placement within the image by "drawing" so-called bounding boxes around each object, once again separating the motorcyclist from the motorcycle. To train detectors even better, however, we need to keep subjecting them to confusing or deceitful adversarial patches as we probe for the models' blind spots. For such probes, we came up with a novel approach, a Low-Detectable Adversarial Patch, which attacks an object detector with small and texture-consistent adversarial patches, making these adversaries less likely to be recognized. Concretely, we use several geometric primitives to model the shapes and positions of the patches. To enhance our attack performance, we also assign different weights to the bounding boxes in terms of loss function. Our experiments on the common detection dataset COCO as well as the driving-video dataset D2-City show that LDAP is an effective attack method, and can resist the adversarial patch detector.

Bounding overwatch

10.48550/arxiv.2109.15177

Cite

Citations (1)

A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking

International Journal of Computer Vision (2024)

Chang Liu Yinpeng Dong Wenzhao Xiang Xiao Yang Hang Su

Robustness

Benchmarking

10.1007/s11263-024-02196-3

Cite

Citations (19)

Unrestricted Adversarial Attacks on ImageNet Competition

arXiv (Cornell University) (2021)

Yuefeng Chen Xiaofeng Mao Yuan He Hui Xue Chao Li

Many works have investigated the adversarial attacks or defenses under the settings where a bounded and imperceptible perturbation can be added to the input. However in the real-world, the attacker does not need to comply with this restriction. In fact, more threats to the deep model come from unrestricted adversarial examples, that is, the attacker makes large and visible modifications on the image, which causes the model classifying mistakenly, but does not affect the normal observation in human perspective. Unrestricted adversarial attack is a popular and practical direction but has not been studied thoroughly. We organize this competition with the purpose of exploring more effective unrestricted adversarial attack algorithm, so as to accelerate the academical research on the model robustness under stronger unbounded attacks. The competition is held on the TianChi platform (\url{https://tianchi.aliyun.com/competition/entrance/531853/introduction}) as one of the series of AI Security Challengers Program.

Robustness

10.48550/arxiv.2110.09903

Cite

Citations (6)

A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking

arXiv (Cornell University) (2023)

Chang Liu Yinpeng Dong Wenzhao Xiang Xiao Yang Hang Su

The robustness of deep neural networks is usually lacking under adversarial examples, common corruptions, and distribution shifts, which becomes an important research problem in the development of deep learning. Although new deep learning methods and robustness improvement techniques have been constantly proposed, the robustness evaluations of existing methods are often inadequate due to their rapid development, diverse noise patterns, and simple evaluation metrics. Without thorough robustness evaluations, it is hard to understand the advances in the field and identify the effective methods. In this paper, we establish a comprehensive robustness benchmark called \textbf{ARES-Bench} on the image classification task. In our benchmark, we evaluate the robustness of 55 typical deep learning models on ImageNet with diverse architectures (e.g., CNNs, Transformers) and learning algorithms (e.g., normal supervised training, pre-training, adversarial training) under numerous adversarial attacks and out-of-distribution (OOD) datasets. Using robustness curves as the major evaluation criteria, we conduct large-scale experiments and draw several important findings, including: 1) there is an inherent trade-off between adversarial and natural robustness for the same model architecture; 2) adversarial training effectively improves adversarial robustness, especially when performed on Transformer architectures; 3) pre-training significantly improves natural robustness based on more training data or self-supervised learning. Based on ARES-Bench, we further analyze the training tricks in large-scale adversarial training on ImageNet. By designing the training settings accordingly, we achieve the new state-of-the-art adversarial robustness. We have made the benchmarking results and code platform publicly available.

Robustness

Benchmarking

10.48550/arxiv.2302.14301

Cite

Citations (4)

Competition on robust deep learning

National Science Review (2023)

Yinpeng Dong Chang Liu Wenzhao Xiang Hang Su Jun Zhu

This perspective paper proposes a new adversarial training method based on large-scale pre-trained models to achieve state-of-the-art adversarial robustness on ImageNet.

Robustness

10.1093/nsr/nwad087

Cite

Citations (1)

AEMIM: Adversarial Examples Meet Masked Image Modeling

arXiv (Cornell University) (2024)

Wenzhao Xiang Chang Liu Hang Su Hongyang Yu

Masked image modeling (MIM) has gained significant traction for its remarkable prowess in representation learning. As an alternative to the traditional approach, the reconstruction from corrupted images has recently emerged as a promising pretext task. However, the regular corrupted images are generated using generic generators, often lacking relevance to the specific reconstruction task involved in pre-training. Hence, reconstruction from regular corrupted images cannot ensure the difficulty of the pretext task, potentially leading to a performance decline. Moreover, generating corrupted images might introduce an extra generator, resulting in a notable computational burden. To address these issues, we propose to incorporate adversarial examples into masked image modeling, as the new reconstruction targets. Adversarial examples, generated online using only the trained models, can directly aim to disrupt tasks associated with pre-training. Therefore, the incorporation not only elevates the level of challenge in reconstruction but also enhances efficiency, contributing to the acquisition of superior representations by the model. In particular, we introduce a novel auxiliary pretext task that reconstructs the adversarial examples corresponding to the original images. We also devise an innovative adversarial attack to craft more suitable adversarial examples for MIM pre-training. It is noted that our method is not restricted to specific model architectures and MIM strategies, rendering it an adaptable plug-in capable of enhancing all MIM methods. Experimental findings substantiate the remarkable capability of our approach in amplifying the generalization and robustness of existing MIM methods. Notably, our method surpasses the performance of baselines on various tasks, including ImageNet, its variants, and other downstream tasks.

10.48550/arxiv.2407.11537

Cite

Citations (0)

Improving the Robustness of Adversarial Attacks Using an Affine-Invariant Gradient Estimator

SSRN Electronic Journal (2022)

Wenzhao Xiang Hang Su Chang Liu Yandong Guo Shibao Zheng

As designers of artificial intelligence try to outwit hackers, both sides continue to hone in on AI's inherent vulnerabilities. Designed and trained from certain statistical distributions of data, AI's deep neural networks (DNNs) remain vulnerable to deceptive inputs that violate a DNN’s statistical, predictive assumptions. Before being fed into a neural network, however, most existing adversarial examples cannot maintain malicious functionality when applied to an affine transformation. For practical purposes, maintaining that malicious functionality serves as an important measure of the robustness of adversarial attacks. To help DNNs learn to defend themselves more thoroughly against attacks, we propose an affine-invariant adversarial attack, which can consistently produce more robust adversarial examples over affine transformations. For efficiency, we propose to disentangle current affine-transformation strategies from the Euclidean geometry coordinate plane with its geometric translations, rotations and dilations; we reformulate the latter two in polar coordinates. Afterwards, we construct an affine-invariant gradient estimator by convolving the gradient at the original image with derived kernels, which can be integrated with any gradient-based attack methods. Extensive experiments on ImageNet, including some experiments under physical condition, demonstrate that our method can significantly improve the affine invariance of adversarial examples and, as a byproduct, improve the transferability of adversarial examples, compared with alternative state-of-the-art methods.

Robustness

10.2139/ssrn.4095198

Cite

Citations (1)