Using Extended Character Feature in Bi-LSTM for DGA Domain Name Detection

Rui Pan,Jian Chen,Hanyuan Ma,Xiaoshan Bai
DOI: https://doi.org/10.1109/icis54925.2022.9882343
2022-01-01
Abstract:The detection of DGA domain name has become a central topic in network security in recent years. With DGA (Domain Generation Algorithm), a large number of pseudo-random domain names could be generated and used in DDoS (Distributed Denial of Service) attacks, reflection amplification attacks and other network attacks. Traditional deep learning algorithms based on character feature have the advantages of automatic feature extraction and shorter training time in DGA domain name detection. But for wordlist-based DGA domain name, the accuracy is lower. In order to improve the detection accuracy, semantic features of domain name are extracted to extend character features for the first time. Experiment results show that using extended character feature in Bi-LSTM (Bi-Directional Long Short-Term Memory) could improve the detection performance. In multi-classification, micro average of F1 score reaches up to 98.72%.
What problem does this paper attempt to address?