FF-BCSD: A Binary Code Similarity Detection Method Based on Feature Fusion

Shucen Liu,Cheng Yu,Lianqiang Wang,Chenyu Wang,Guosheng Xu
DOI: https://doi.org/10.1109/iccc59590.2023.10507703
2023-01-01
Abstract:With the explosive growth of Internet of Things (IoT) devices, software engineers often reuse third-party code. If the security vulnerability is in the reused code, all downstream IoT components reusing this code will introduce the same vulnerability. To address this problem, we propose a binary code similarity detection method based on feature fusion. We construct features for instruction sequences and control flow graphs (CFGs) using deep learning models. These two features are fused to form the deep feature representation of a function as similarity measurement. Compared to Gemini and VulSeeker, our method has better results in accuracy, precision, recall, and F1. Besides, the experiment results show that deep feature representations perform better than the single feature in different scenarios.
What problem does this paper attempt to address?