Research on Malicious JavaScript Detection Technology Based on LSTM

Yong Fang,Cheng Huang,Liang Liu,Min Xue
DOI: https://doi.org/10.1109/access.2018.2874098
IF: 3.9
2018-01-01
IEEE Access
Abstract:The attacker injects malicious JavaScript into web pages to achieve the purpose of implanting Trojan horses, spreading viruses, phishing, and obtaining secret information. By analyzing the existing researches on malicious JavaScript detection, a malicious JavaScript detection model based on LSTM (Long Short-Term Memory) is proposed. Features are extracted from the semantic level of bytecode, and the method of word vector is optimized. It can distinguish malicious JavaScript code and combat obfuscated code effectively. Experiments showed that the accuracy of detection model based on LSTM is 99.51%, and the F1-score is 98.37%, which is better than the existing model based on Random Forest and SVM algorithm.
What problem does this paper attempt to address?