BovdGFE: Buffer Overflow Vulnerability Detection Based on Graph Feature Extraction
Lv Xinghang,Peng Tao,Chen Jia,Liu Junping,Hu Xinrong,He Ruhan,Jiang Minghua,Cao Wenli
DOI: https://doi.org/10.1007/s10489-022-04214-8
IF: 5.3
2022-01-01
Applied Intelligence
Abstract:Automatically detecting buffer overflow vulnerabilities is an important research topic in software security. Recent studies have shown that vulnerability detection performance utilizing deep learning-based techniques can be significantly enhanced. However, due to information loss during code representation, existing approaches cannot learn the features associated with vulnerabilities, leading to a high false negative rate (FNR) and low precision. To resolve the existing problems, we propose a method for buffer overflow vulnerability detection based on graph feature extraction (BovdGFE) in C/C++ programs. BovdGFE constructs the buffer overflow function samples. Then, we present a new representation structure, code representation sequence (CoRS), which incorporates the control flow, data dependencies, and syntax structure of the vulnerable code for reducing information loss during code representation. After the function samples are transformed into CoRS, a deep learning model is used to learn vulnerable features and perform vulnerability classification. The results of the experiments show that BovdGFE improves the precision and FNR by 6.3 % and 3.9 % respectively compared with state-of-the-art methods, which can significantly improve the capability of vulnerability detection.