Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models

Zhenyang Ni,Rui Ye,Yuxi Wei,Zhen Xiang,Yanfeng Wang,Siheng Chen

2024-04-22

Abstract:Vision-Large-Language-models(VLMs) have great application prospects in autonomous driving. Despite the ability of VLMs to comprehend and make decisions in complex scenarios, their integration into safety-critical autonomous driving systems poses serious security risks. In this paper, we propose BadVLMDriver, the first backdoor attack against VLMs for autonomous driving that can be launched in practice using physical objects. Unlike existing backdoor attacks against VLMs that rely on digital modifications, BadVLMDriver uses common physical items, such as a red balloon, to induce unsafe actions like sudden acceleration, highlighting a significant real-world threat to autonomous vehicle safety. To execute BadVLMDriver, we develop an automated pipeline utilizing natural language instructions to generate backdoor training samples with embedded malicious behaviors. This approach allows for flexible trigger and behavior selection, enhancing the stealth and practicality of the attack in diverse scenarios. We conduct extensive experiments to evaluate BadVLMDriver for two representative VLMs, five different trigger objects, and two types of malicious backdoor behaviors. BadVLMDriver achieves a 92% attack success rate in inducing a sudden acceleration when coming across a pedestrian holding a red balloon. Thus, BadVLMDriver not only demonstrates a critical security risk but also emphasizes the urgent need for developing robust defense mechanisms to protect against such vulnerabilities in autonomous driving technologies.

Cryptography and Security

What problem does this paper attempt to address?

This paper presents a practical physical backdoor attack called BadVLMDriver targeting Visual-Linguistic Models (VLMs) used in autonomous driving. Although VLMs show potential in understanding and decision-making in complex scenarios, integrating them into critical safety systems such as autonomous driving brings serious security risks. BadVLMDriver is the first backdoor attack that exploits everyday physical objects (e.g. red balloons) to induce dangerous behaviors (e.g. sudden acceleration), revealing real threats to the security of autonomous driving technology. The attack consists of two steps: first, generating backdoor training samples containing malicious behaviors through natural language instructions, which are composed of images edited by diffusion model and text responses modified by large-scale language models; second, fine-tuning victim VLMs on the generated backdoors and benign samples through visual instructions. This process reduces manual work, enhances the concealment and practicality of the attack. Experiments show that BadVLMDriver can induce vehicle acceleration with a success rate of 92% when encountering pedestrians holding red balloons. This not only demonstrates serious safety risks but also emphasizes the need to develop robust defense mechanisms against such vulnerabilities. The paper also discusses the flexibility and efficiency advantages of physical backdoor attacks compared to existing digital backdoor attacks targeting VLMs. In conclusion, the paper aims to reveal the potential security issues of VLMs in autonomous driving applications and proposes a feasible physical backdoor attack scheme, calling for attention to and strengthening of security measures for autonomous driving technology.

Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models

Clean-Annotation Backdoor Attack against Lane Detection Systems in the Wild

Adversarial Computer Vision Via Acoustic Manipulation of Camera Sensors

PLA-LiDAR: Physical Laser Attacks Against LiDAR-based 3D Object Detection in Autonomous Vehicle.

Adversarial Robustness Analysis of LiDAR-included Models in Autonomous Driving

Physical Backdoor Attacks to Lane Detection Systems in Autonomous Driving

Robust Roadside Physical Adversarial Attack Against Deep Learning in Lidar Perception Modules

Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-based Decision-Making Systems

A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving

Stealthy and Effective Physical Adversarial Attacks in Autonomous Driving

Towards Robust Physical-world Backdoor Attacks on Lane Detection

Attacking vision-based perception in end-to-end autonomous driving models

Driving into Danger: Adversarial Patch Attack on End-to-End Autonomous Driving Systems Using Deep Learning

You Can't See Me: Physical Removal Attacks on LiDAR-based Autonomous Vehicles Driving Frameworks

Physical Backdoor Trigger Activation of Autonomous Vehicle using Reachability Analysis

Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns

Adversarial Sensor Attack on LiDAR-based Perception in Autonomous Driving

Partial-Information, Longitudinal Cyber Attacks on LiDAR in Autonomous Vehicles

Too Afraid to Drive: Systematic Discovery of Semantic DoS Vulnerability in Autonomous Driving Planning under Physical-World Attacks

Backdoor Attacks Against Deep Learning Systems in the Physical World

Towards Robust LiDAR-based Perception in Autonomous Driving: General Black-box Adversarial Sensor Attack and Countermeasures