Obfuscated Code is Identifiable by a Token-Based Code Clone Detection Technique.

Junaid Akram,Danish Vasan,Ping Luo
DOI: https://doi.org/10.1504/ijics.2022.127132
2022-01-01
International Journal of Information and Computer Security
Abstract:Recently, developers use obfuscation techniques to make their code difficult to understand or analyse, especially malware developers. In Android applications, if the application is obfuscated, it is hard to retrieve the exact source code after applying reverse engineering techniques on it. In this paper, we propose an approach which is based on clone detection technique and it can detect obfuscated code in Android applications very efficiently. We perform two experiments on different types of datasets including obfuscated and non-obfuscated application's source code. We successfully detected the obfuscated code of two types, including identifier-renaming and string-encryption with a high accuracy of 95%. A comparative study with other state-of-the-art tools proves the efficiency of our proposed approach. Experimental results show that our approach is reliable, efficient and can be implemented at a large-scale level.
What problem does this paper attempt to address?