Android Malware Clustering through Malicious Payload Mining

Yuping Li,Jiyong Jang,Xin Hu,Xinming Ou
DOI: https://doi.org/10.48550/arXiv.1707.04795
2017-07-15
Cryptography and Security
Abstract:Clustering has been well studied for desktop malware analysis as an effective triage method. Conventional similarity-based clustering techniques, however, cannot be immediately applied to Android malware analysis due to the excessive use of third-party libraries in Android application development and the widespread use of repackaging in malware development. We design and implement an Android malware clustering system through iterative mining of malicious payload and checking whether malware samples share the same version of malicious payload. Our system utilizes a hierarchical clustering technique and an efficient bit-vector format to represent Android apps. Experimental results demonstrate that our clustering approach achieves precision of 0.90 and recall of 0.75 for Android Genome malware dataset, and average precision of 0.98 and recall of 0.96 with respect to manually verified ground-truth.
What problem does this paper attempt to address?