Learning The Likelihood Test With One-Class Classifiers for Physical Layer Authentication

Francesco Ardizzon,Stefano Tomasin
2024-08-04
Abstract:In physical layer authentication (PLA) mechanisms, a verifier decides whether a received message has been transmitted by a legitimate user or an intruder, according to some features of the physical channel over which the message traveled. To design the authentication check implemented at the verifier, typically either the statistics or a dataset of features are available for the channel from the legitimate user, while no information is available when under attack. When the statistics are known, a well-known good solution is the likelihood test (LT). When a dataset is available, the decision problem is one-class classification (OCC) and a good understanding of the machine learning (ML) techniques used for its solution is important to ensure security. Thus, in this paper, we aim at obtaining ML PLA verifiers that operate as the LT. We show how to do it with the neural network (NN) and the one-class least-squares support vector machine (OCLSSVM) models, trained as two-class classifiers on the single-class dataset and an artificial dataset. The artificial dataset for the negative class is obtained by generating channel feature (CF) vectors uniformly distributed over the domain of the legitimate class dataset. We also derive a modified stochastic gradient descent (SGD) algorithm that trains a PLA verifier operating as LT without the need for the artificial dataset. Furthermore, we show that the one-class least-squares support vector machine with suitable kernels operates as the LT at convergence. Lastly, we show that the widely used autoencoder classifier generally does not provide the LT. Numerical results are provided considering PLA on both wireless and underwater acoustic channels.
Machine Learning,Signal Processing
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve an important problem in Physical Layer Authentication (PLA): how to design a machine - learning model that can work like the Likelihood Ratio Test (LT) when only having the channel characteristic data of legitimate users. Specifically: 1. **Background and Challenges**: - In the PLA mechanism, the verifier needs to judge whether the message is from a legitimate user or an intruder according to the physical channel characteristics of the received message. - When the verifier only knows the statistical information or data set of the channel characteristics of legitimate users and has no information about attackers, the traditional multi - class classification methods cannot be directly applied. - In this case, the One - Class Classification (OCC) method is usually adopted, but the existing OCC methods are not necessarily equivalent to the optimal statistical test method - the Likelihood Ratio Test (LT). 2. **Objectives**: - Researchers hope to develop a machine - learning model that can be trained to behave like LT when only having legitimate user data. - Specifically, they use Neural Network (NN) and Least - Squares Support Vector Machine (LS - SVM) as classifiers and achieve this goal by generating artificial data sets or modifying the gradient descent algorithm. 3. **Main Contributions**: - Two methods are proposed to make machine - learning models (NN and LS - SVM) work like LT after training: - Method 1: Train the model by generating a uniformly distributed artificial negative - class data set. - Method 2: Propose a Modified Stochastic Gradient Descent (MSGD) algorithm without explicitly generating artificial data sets. - It is proved that these methods can converge to LT under appropriate conditions. - It is further shown that Autoencoder (AE) generally cannot provide LT behavior. 4. **Application Scenarios**: - The paper verifies the effectiveness of the proposed methods through experimental results in two scenarios: wireless communication and underwater acoustic communication. In summary, this paper is committed to bridging the gap between the statistical framework and the machine - learning framework, so that the machine - learning model can achieve statistically optimal authentication performance when only having legitimate user data.