Cyberspace Attack Detection Based On Advanced Initialized Recurrent Neural Network

Chaopeng Li,Yiqiang Sheng,Jinlin Wang
DOI: https://doi.org/10.24507/ijicic.15.04.1413
2019-01-01
Abstract:The recurrent neural network (RNN) can be used to detect cyber attacks. However, during training, vanishing and exploding gradients are encountered. Thus, in this study, an RNN model with an advanced weight initialization is proposed to alleviate possible gradient problems. We considered attack detection as a classification task and adopted hierarchical RNN and multilayer perceptron (MLP) model to identify attacks. Thereafter, the causes of vanishing or exploding gradients are analyzed. Based on the distribution of cyberspace data, derivations are conducted and an improved weight initialization approach facing RNN was employed. There are two formats for RNN outputs: last step only and all steps available. Accordingly, initializations for these different formats are fine-tuned. Based on a public dataset, various learning convergences of the state-of-the-art initialization schemes, suggested for use for the past three years, are compared, and the influence of different embedding methods is discussed as well. Finally, experiments are conducted. Results show that our proposed initialization method has a lower error rate (about 9% relative decrease) than other initializations have.
What problem does this paper attempt to address?