COCL: an Intelligent Framework for Enhancing Deep Learning-Based Vulnerability Detection

Wenxuan Li,Shihan Dou,Yueming Wu,Chenxi Li,Yang Liu
DOI: https://doi.org/10.1109/tii.2023.3329670
IF: 12.3
2024-01-01
IEEE Transactions on Industrial Informatics
Abstract:Due to the powerful feature extraction capability of deep learning (DL), many recent studies have used it to conduct source code vulnerability analysis. However, although it has a good performance on artificial datasets, it does not perform satisfactorily on the real-world vulnerabilities with higher complexity. In this article, we introduce contrastive curriculum learning into DL-based vulnerability detection to find a suitable boundary to distinguish vulnerabilities from normal codes. Contrastive learning can be used to reduce the difference between different vulnerabilities while amplifying the difference between vulnerabilities and normal codes. To make the training phase of contrastive learning more intelligent, we apply curriculum learning to mimic the way humans acquire knowledge, which means that the model will learn simple samples first and then increase the difficulty of training samples. Specifically, we implement an intelligent framework (i.e., contrastive curriculum learning (COCL) ) that can enhance the detection effect of existing DL-based vulnerability detectors. To verify the capability of COCL , we select four state-of-the-art DL-based vulnerability detectors (i.e., AutoVulTC , VulDeePecker , BenchSG , and Devign ) as our base models. The experimental results show that using COCL can bring an improvement of 8.1% to the F1 scores of these models on a real-world vulnerability dataset.
What problem does this paper attempt to address?