Dns Tunneling Detection with New Patterns Emerging: A Forest-Based Classifier with an Unknown Option

Shenfei Pei,feiping Nie,Rong Wang,Jun Wu,Qinglong Chang,Liang Zhang
DOI: https://doi.org/10.2139/ssrn.4170486
2022-01-01
SSRN Electronic Journal
Abstract:Arbitrary data on a computer having external network connectivity can be sent to the attacker by making Domain Name System ( DNS ) queries. This technique is known as DNS tunneling. DNS tunneling detection can be seen as a classification task in the view of machine learning. However, the traditional classifiers cannot accurately predict the classes of queries whose patterns do not appear during training and these queries are inevitable in the live network. For convenience, we call them unknown queries. This paper first discussed the risks of DNS tunneling and the difficulties faced by DNS tunneling detection. Then we described DNS tunneling detection as a machine learning problem, i.e., Classification with New Patterns Emerging ( CNPE ), and presented its definition. Finally, we proposed a forest-based classifier to tackle this problem. Our model can identify unknown samples as a new class as well as accurately classify samples that are not unknown. The computational overhead of our model is near-linear with respect to the number of samples, and it is memory-friendly. Experiments are conducted on queries obtained from the live network and on public datasets . Experimental results show the effectiveness of the proposed algorithm.
What problem does this paper attempt to address?