The Method of Malware Detection based on Information Gain Characteristics Optimization Select

Wang Chang-Zhi,Liang Gang,Yang Jin,Chen Wen
DOI: https://doi.org/10.3969/j.issn.1671-0428.2013.04.004
2013-01-01
Abstract:API call sequence was a commonly method used into malware detection and classification,but there is a lack of efficacious ways to choose significant API sequences as detection features.Besides,redundancies exist among chosen features.All these problems lead to low detection accuracy.This paper come up with and implement a malware detection method which optimize the selection of features based on information gain.When choosing API sequences,not only the information gain but also the frequency and the degree of concentration of API are taken into consideration in our method in order to select features more contributive to classification.Our experiment has showed that this method has a high malware detection rate and can maintain low false alarm rate of normal software.
What problem does this paper attempt to address?