Bidirectional IndRNN malicious webpages detection algorithm based on convolutional neural network and attention mechanism.

Huan-huan Wang,Sheng-Wei Tian,Long Yu,Xian-Xian Wang,Qing-Shan Qi,Ji-Hong Chen
DOI: https://doi.org/10.3233/JIFS-190455
2020-01-01
Journal of Intelligent & Fuzzy Systems
Abstract:A convolutional neural network combined with attention mechanism and a parallel joint algorithm model (CATTB) of bidirectional independent recurrent neural network are proposed. The algorithm extracts the relocation feature and the "texture fingerprint" feature for expressing the similarity of the URL (Uniform Resource Locator) binary file content of the malicious web page, and uses the word vector tool word2vec to train the URL word vector feature and extract the URL static vocabulary feature. CNN (Convolutional Neural Network) is used to extract deep local features. Secondly, Attention mechanism adjusts weight and BiIndRNN (Bidirectional Independently Recurrent Neural Network) to extract global features. Finally, softmax is used for classification. This paper extracts more comprehensive features from different angles and using different methods. The experimental results show that the test results are higher than other researchers, and compared with other algorithms, the proposed CATTB algorithm improves the accuracy of malicious web page detection.
What problem does this paper attempt to address?