Abstract:In the software engineering (SE) community, deep learning (DL) has recently been applied to many source code processing tasks, achieving state-of-the-art results. Due to the poor interpretability of DL models, their security vulnerabilities require scrutiny. Recently, researchers have identified an emergent security threat to DL models, namely poison attacks . The attackers aim to inject insidious backdoors into DL models by poisoning the training data with poison samples. The backdoors mean that poisoned models work normally with clean inputs but produce targeted erroneous results with inputs embedded with specific triggers. By using triggers to activate backdoors, attackers can manipulate poisoned models in security-related scenarios ( e.g., defect detection) and lead to severe consequences. To verify the vulnerability of deep source code processing models to poison attacks, we present a poison attack approach for source code named CodePoisoner as a strong imaginary enemy. CodePoisoner can produce compilable and functionality-preserving poison samples and effectively attack deep source code processing models by poisoning the training data with poison samples. To defend against poison attacks, we further propose an effective poison detection approach named CodeDetector . CodeDetector can automatically identify poison samples in the training data. We apply CodePoisoner and CodeDetector to six deep source code processing models, including defect detection, clone detection, and code repair models. The results show that 1 CodePoisoner conducts successful poison attacks with a high attack success rate (avg: 98.3%, max: 100%). It validates that existing deep source code processing models have a strong vulnerability to poison attacks. 2 CodeDetector effectively defends against multiple poison attack approaches by detecting (max: 100%) poison samples in the training data. We hope this work can help SE researchers and practitioners notice poison attacks and inspire the design of more advanced defense techniques.

Model Poisoning Attack Against Neural Network Interpreters in IoT Devices

Data Poisoning Attacks in Internet-of-Vehicle Networks: Taxonomy, State-of-The-Art, and Future Directions.

Adversarial Example Attacks in Internet of Things (IoT)

Online data poisoning attack against edge AI paradigm for IoT-enabled smart city

Poison Attack and Defense on Deep Source Code Processing Models

Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks

Poisoning attacks and countermeasures in intelligent networks: Status quo and prospects

Neural Model Stealing Attack to Smart Mobile Device on Intelligent Medical Platform

Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks

Poisoning Web-Scale Training Datasets is Practical

Concealed Data Poisoning Attacks on NLP Models

MetaPoison: Practical General-purpose Clean-label Data Poisoning

Poison Attack and Poison Detection on Deep Source Code Processing Models

Adversarial Attacks Against Network Intrusion Detection in IoT Systems

Disarming Steganography Attacks Inside Neural Network Models

Poisoning Attacks in Federated Edge Learning for Digital Twin 6G-enabled IoTs: An Anticipatory Study

Research on Data Poisoning Attack against Smart Grid Cyber–Physical System Based on Edge Computing

Attack-model-agnostic defense against model poisonings in distributed learning

A Tale of Evil Twins: Adversarial Inputs Versus Poisoned Models

SGBA: A Stealthy Scapegoat Backdoor Attack Against Deep Neural Networks

Fault Injection and Safe-Error Attack for Extraction of Embedded Neural Network Models