Abstract: Deep neural networks (DNNs) are vulnerable to adversarial noises, which motivates the benchmark of model robustness. Existing benchmarks mainly focus on evaluating the defenses, but there are no comprehensive studies of how architecture design and general training techniques affect robustness. Comprehensively benchmarking their relationships will be highly beneficial for better understanding and developing robust DNNs. Thus, we propose RobustART, the first comprehensive Robustness investigation benchmark on ImageNet (including open-source toolkit, pre-trained model zoo, datasets, and analyses) regarding ARchitecture design (44 human-designed off-the-shelf architectures and 1200+ networks from neural architecture search) and Training techniques (10+ general techniques, e.g., data augmentation) towards diverse noises (adversarial, natural, and system noises). Extensive experiments revealed and substantiated several insights for the first time, for example: (1) adversarial training largely improves the clean accuracy and all types of robustness for Transformers and MLP-Mixers; (2) with comparable sizes, CNNs > Transformers > MLP-Mixers on robustness against natural and system noises; Transformers > MLP-Mixers > CNNs on adversarial robustness; (3) for some light-weight architectures (e.g., EfficientNet, MobileNetV2, and MobileNetV3), increasing model sizes or using extra training data cannot improve robustness. Our benchmark http://robust.art/ : (1) presents an open-source platform for conducting comprehensive evaluation on diverse robustness types; (2) provides a variety of pre-trained models with different training techniques to facilitate robustness evaluation; (3) proposes a new view to better understand the mechanism towards designing robust DNN architectures, backed up by the analysis. We will continuously contribute to building this ecosystem for the community.

Are Transformer-Based Models More Robust Than CNN-based Models?

Can CNNs Be More Robust Than Transformers?

Are Transformers More Robust? Towards Exact Robustness Verification for Transformers

ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

The Efficacy of Transformer-based Adversarial Attacks in Security Domains

On the Adversarial Robustness of Vision Transformers

A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking

On the Robustness of Vision Transformers to Adversarial Examples

ROBY: Evaluating the adversarial robustness of a deep model by its decision boundaries

Benchmarking the Robustness of Spatial-Temporal Models Against Corruptions

ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Robust crack detection in masonry structures with Transformers

Predicted Robustness as QoS for Deep Neural Network Models

Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models

Enhancing the robustness of vision transformer defense against adversarial attacks based on squeeze-and-excitation module

Large-scale Robustness Analysis of Video Action Recognition Models

RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

Out of Distribution Performance of State of Art Vision Model

Robustness and Transferability of Adversarial Attacks on Different Image Classification Neural Networks