DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS

Qingyang Li,Yihang Zhang,Zhidong Jia,Yannan Hu,Lei Zhang,Jianrong Zhang,Yongming Xu,Yong Cui,Zongming Guo,Xinggong Zhang
2024-01-01
Abstract:It is an interesting question Can and How Large Language Models (LLMs)understand non-language network data, and help us detect unknown maliciousflows. This paper takes Carpet Bombing as a case study and shows how to exploitLLMs' powerful capability in the networking area. Carpet Bombing is a new DDoSattack that has dramatically increased in recent years, significantlythreatening network infrastructures. It targets multiple victim IPs withinsubnets, causing congestion on access links and disrupting network services fora vast number of users. Characterized by low-rates, multi-vectors, theseattacks challenge traditional DDoS defenses. We propose DoLLM, a DDoS detectionmodel utilizes open-source LLMs as backbone. By reorganizing non-contextualnetwork flows into Flow-Sequences and projecting them into LLMs semantic spaceas token embeddings, DoLLM leverages LLMs' contextual understanding to extractflow representations in overall network context. The representations are usedto improve the DDoS detection performance. We evaluate DoLLM with publicdatasets CIC-DDoS2019 and real NetFlow trace from Top-3 countrywide ISP. Thetests have proven that DoLLM possesses strong detection capabilities. Its F1score increased by up to 33.3real ISP traces.
What problem does this paper attempt to address?