Identifying Authorship in Malicious Binaries: Features, Challenges & Datasets

Jason Gray,Daniele Sgandurra,Lorenzo Cavallaro,Jorge Blasco
DOI: https://doi.org/10.1145/3653973
IF: 16.6
2024-03-26
ACM Computing Surveys
Abstract:Attributing a piece of malware to its creator typically requires threat intelligence. Binary attribution increases the level of difficulty as it mostly relies upon the ability to disassemble binaries to obtain authorship-related features. We perform a systematic analysis of works in the area of malware authorship attribution. We identify key findings, some shortcomings of current approaches and explore the open research challenges. To mitigate the lack of ground truth datasets in this domain, we publish alongside this survey the largest and most diverse meta-information dataset of 17,513 malware labeled to 275 threat actor groups.
computer science, theory & methods
What problem does this paper attempt to address?