Universal Adversarial Perturbations for Vision-Language Pre-trained Models

Peng-Fei Zhang,Zi Huang,Guangdong Bai
2024-05-09
Abstract:Vision-language pre-trained (VLP) models have been the foundation of numerous vision-language tasks. Given their prevalence, it be- comes imperative to assess their adversarial robustness, especially when deploying them in security-crucial real-world applications. Traditionally, adversarial perturbations generated for this assessment target specific VLP models, datasets, and/or downstream tasks. This practice suffers from low transferability and additional computation costs when transitioning to new scenarios.
Computer Vision and Pattern Recognition,Multimedia
What problem does this paper attempt to address?