Deep Graph Neural Networks for Malware Detection Using Ghidra P-Code

Rinaldo Iorizzo,Bo Yuan
DOI: https://doi.org/10.34190/eccws.23.1.2344
2024-06-21
Abstract:This work examines the effectiveness of using Ghidra P-Code as semantics-based features in a graph neural network-based malware detection system. A preliminary model exhibits a function level precision of ∼70% and a recall around ∼60%, and a precision and recall of ~55% and ~80% respectively for the program level detection task on a dataset of ∼50,000 control flow graphs extracted from functions of malicious and benign programs. Future improvements to this ongoing project include, but are not limited to, collecting dynamic control flow graph information as opposed to static graphs to provide the model with resilience to advanced malware obfuscation and encryption schemes.
What problem does this paper attempt to address?