Diversity supporting robustness: Enhancing adversarial robustness via differentiated ensemble predictions

Xi Chen,Wei Huang,Ziwen Peng,Wei Guo,Fan Zhang
DOI: https://doi.org/10.1016/j.cose.2024.103861
IF: 5.105
2024-04-29
Computers & Security
Abstract:Deep learning models remain susceptible to adversarial attacks, prompting a primary focus on individual defense strategies. Previous studies have shown that increasing diversity among ensemble models can enhance their adversarial robustness. However, existing diversity metrics often lack a direct association with robustness, potentially limiting ensemble methods' effectiveness in countering adversarial attacks. This paper presents a new diversity training method, termed Enhancing Adversarial Robustness through Diversity that Supports Robustness (EADSR), which is strongly linked to increased adversarial robustness. We developed a safety space hypothesis and a simultaneous training method based on the models' responsiveness to non-natural perturbations which amplifies ensemble model diversity. . By categorizing the ensemble models' prediction behaviors into four classes, four regularization methods are applied for diversity training throughout the training process. We conducted comparative analysis with existing diversity methods to find a remarkable 30%–100% improvement on multiple benchmark datasets. The EADSR method incurs manageable training costs and is applicable to complex datasets such as ImageNet, demonstrating its effectiveness and practicality as a defense approach.
computer science, information systems
What problem does this paper attempt to address?