Abstract:The security guarantee of AI-enabled software systems (particularly using deep learning techniques as a functional core) is pivotal against the adversarial attacks exploiting software vulnerabilities. However, little attention has been paid to a systematic investigation of vulnerabilities in such systems. A common situation learned from the open source software community is that deep learning engineers frequently integrate off-the-shelf or open-source learning frameworks into their ecosystems. In this work, we specifically look into deep learning (DL) framework and perform the first systematic study of vulnerabilities in DL systems through a comprehensive analysis of identified vulnerabilities from Common Vulnerabilities and Exposures (CVE) and open-source DL tools, including TensorFlow, Caffe, OpenCV, Keras, and PyTorch. We propose a two-stream data analysis framework to explore vulnerability patterns from various databases. We investigate the unique DL frameworks and libraries development ecosystems that appear to be decentralized and fragmented. By revisiting the Common Weakness Enumeration (CWE) List, which provides the traditional software vulnerability related practices, we observed that it is more challenging to detect and fix the vulnerabilities throughout the DL systems lifecycle. Moreover, we conducted a large-scale empirical study of 3,049 DL vulnerabilities to better understand the patterns of vulnerability and the challenges in fixing them. We have released the full replication package at <a class="link-external link-https" href="https://github.com/codelzz/Vulnerabilities4DLSystem" rel="external noopener nofollow">this https URL</a>. We anticipate that our study can advance the development of secure DL systems.

What problem does this paper attempt to address?

The problems that this paper attempts to solve are the security weaknesses and vulnerabilities in deep - learning systems. Specifically, the author focuses on the widely - existing security vulnerability problems in deep - learning frameworks and hopes to reveal the patterns, root causes and repair challenges of these vulnerabilities through systematic research. The following is a summary of the core issues in the paper: ### 1. **Identifying and Analyzing Vulnerabilities in Deep - Learning Systems** - **Problem Description**: Although deep - learning technology has made remarkable progress in multiple fields, its security issues have not been fully emphasized. Especially in critical applications (such as aviation flight control and self - driving), once security vulnerabilities occur, they may bring serious consequences. - **Research Motivation**: Existing research mainly focuses on vulnerability analysis of general - purpose software systems, while relatively little research has been done on the security weaknesses and vulnerabilities of deep - learning systems. Therefore, this paper aims to fill this gap by conducting a systematic study of the vulnerabilities in commonly - used deep - learning frameworks (such as TensorFlow, Caffe, OpenCV, PyTorch and Keras). ### 2. **Exploring the Root Causes and Symptoms of Vulnerabilities** - **Problem Description**: In order to better understand the vulnerabilities in deep - learning systems, it is necessary to deeply analyze the root causes and manifestations of these vulnerabilities. This helps to develop more effective detection and repair methods. - **Research Content**: The author explored the root causes and symptoms of vulnerabilities by manually analyzing 3,049 vulnerabilities and extracting data from more than 10,360 code commits and 443 official CVE records, and proposed a classification system. ### 3. **Discussing the Challenges of Detecting and Repairing Vulnerabilities** - **Problem Description**: Compared with traditional software, the detection and repair of vulnerabilities in deep - learning systems face more challenges. For example, due to the complexity and dynamics of deep - learning systems, traditional vulnerability detection tools and methods may not be applicable. - **Research Content**: The author discussed the challenges faced in detecting and repairing vulnerabilities in deep - learning systems and proposed future research directions to improve the methods of vulnerability detection and repair. ### 4. **Providing Reproducible Research Resources** - **Problem Description**: In order to make the research results more credible and practical, the author hopes to provide reproducible research resources for other researchers and practitioners. - **Research Content**: The author open - sourced the source code, data set and analysis results for software security researchers and practitioners to use. All resources can be found on GitHub. ### Summary The main goal of this paper is to promote the development of secure deep - learning systems by comprehensively and systematically analyzing the vulnerabilities in deep - learning systems, revealing their root causes, manifestations and repair challenges. The author hopes that through this research, people's awareness of the security of deep - learning systems can be improved and valuable references can be provided for future vulnerability detection and repair. --- If you have more specific questions or need further assistance, please feel free to let me know!

On Security Weaknesses and Vulnerabilities in Deep Learning Systems

Security Risks in Deep Learning Implementations

Review: Deep Learning Methods for Cybersecurity and Intrusion Detection Systems

Towards Security Threats of Deep Learning Systems: A Survey

Deep Learning based Vulnerability Detection: Are We There Yet?

Deep Learning Under Siege: Identifying Security Vulnerabilities and Risk Mitigation Strategies

Toward Improved Deep Learning-based Vulnerability Detection

Survey of Source Code Vulnerability Analysis Based on Deep Learning

Software Vulnerability Detection Using Deep Neural Networks: A Survey

A Comparative Study of Deep Learning-Based Vulnerability Detection System

Towards Making Deep Learning-based Vulnerability Detectors Robust

A Survey on Automated Software Vulnerability Detection Using Machine Learning and Deep Learning

VulDeePecker: A Deep Learning-Based System for Vulnerability Detection

DEEPSEC: A Uniform Platform for Security Analysis of Deep Learning Model.

Software Vulnerability Mining and Analysis Based on Deep Learning

Security and Machine Learning in the Real World

Understanding and Tackling Label Errors in Deep Learning-Based Vulnerability Detection (experience Paper).

A Survey of Automatic Software Vulnerability Detection, Program Repair, and Defect Prediction Techniques

Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries

Adversarial Attacks and Defenses in Deep Learning: From a Perspective of Cybersecurity

Towards characterizing adversarial defects of deep learning software from the lens of uncertainty