T-PRIME: Transformer-based Protocol Identification for Machine-learning at the Edge

Mauro Belgiovine,Joshua Groen,Miquel Sirera,Chinenye Tassie,Ayberk Yarkın Yıldız,Sage Trudeau,Stratis Ioannidis,Kaushik Chowdhury
2024-03-05
Abstract:Spectrum sharing allows different protocols of the same standard (e.g., 802.11 family) or different standards (e.g., LTE and DVB) to coexist in overlapping frequency bands. As this paradigm continues to spread, wireless systems must also evolve to identify active transmitters and unauthorized waveforms in real time under intentional distortion of preambles, extremely low signal-to-noise ratios and challenging channel conditions. We overcome limitations of correlation-based preamble matching methods in such conditions through the design of T-PRIME: a Transformer-based machine learning approach. T-PRIME learns the structural design of transmitted frames through its attention mechanism, looking at sequence patterns that go beyond the preamble alone. The paper makes three contributions: First, it compares Transformer models and demonstrates their superiority over traditional methods and state-of-the-art neural networks. Second, it rigorously analyzes T-PRIME's real-time feasibility on DeepWave's AIR-T platform. Third, it utilizes an extensive 66 GB dataset of over-the-air (OTA) WiFi transmissions for training, which is released along with the code for community use. Results reveal nearly perfect (i.e. $>98\%$) classification accuracy under simulated scenarios, showing $100\%$ detection improvement over legacy methods in low SNR ranges, $97\%$ classification accuracy for OTA single-protocol transmissions and up to $75\%$ double-protocol classification accuracy in interference scenarios.
Machine Learning,Networking and Internet Architecture,Systems and Control
What problem does this paper attempt to address?
The main problem this paper attempts to address is the issue of protocol classification and identification in a wireless spectrum sharing environment. Specifically, with the increasing coexistence of different communication protocols within the same frequency bandwidth, how to accurately identify active transmission protocols and unauthorized waveforms in real-time under conditions of interference, low signal-to-noise ratio (SNR), and complex channel conditions has become a significant challenge. ### Background and Problem 1. **Scarcity of Spectrum Resources**: With the increasing demand for wireless services, spectrum resources have become scarce, leading to the coexistence of various communication protocols within the same frequency bandwidth, making the wireless spectrum environment crowded. 2. **Unauthorized Transmissions**: Unauthorized transmissions further increase security risks, posing threats to critical operations. 3. **Limitations of Traditional Methods**: - **Need for Prior Knowledge**: Traditional protocol classification methods require detailed knowledge of the protocols, making it difficult to detect new protocols. - **Performance Degradation at Low SNR**: Even with complete knowledge of the protocols, correlation-based methods struggle to accurately detect under low SNR conditions. - **Multi-Protocol Overlap**: When multiple protocols overlap within the same frequency bandwidth, the detection accuracy of traditional methods significantly decreases. ### Solution To overcome these challenges, the paper proposes **T-PRIME** (Transformer-based Protocol Recognition for Edge Computing). The main features of T-PRIME include: 1. **Transformer Architecture**: Utilizing the attention mechanism of the Transformer model to learn the structural design of transmission frames, not just the preamble. 2. **Real-Time Feasibility**: Conducted rigorous real-time feasibility analysis on the DeepWave AIR-T platform. 3. **Large-Scale Dataset**: Trained using a 66GB over-the-air (OTA) WiFi signal dataset, with the code and dataset made publicly available for community use. ### Main Contributions 1. **Model Comparison**: Experimentally compared the Transformer model with other traditional methods and state-of-the-art neural networks, demonstrating the superiority of the Transformer model. 2. **Real-Time Performance Evaluation**: Implemented T-PRIME on the AIR-T platform and evaluated its performance in real-world scenarios. 3. **Dataset Release**: Released the first OTA WiFi signal dataset containing multiple protocol versions, covering different transmission environments and interference conditions. ### Experimental Results - In simulated scenarios, T-PRIME achieved a 100% detection improvement in low SNR ranges, with a classification accuracy of 97% for single-protocol OTA transmissions and up to 75% accuracy for dual-protocol classification in interference scenarios. ### Conclusion By leveraging the attention mechanism of the Transformer model, T-PRIME successfully addresses the problem of real-time accurate identification of wireless protocols under low SNR and complex channel conditions, providing a new solution for enhancing the performance of wireless systems in spectrum sharing environments.