Covert Communication in NOMA Systems with Decision-Assisted Q-learning

Jiaqing Bai,Ji He,Xiaohong Jiang
DOI: https://doi.org/10.1109/candarw57323.2022.00056
2022-01-01
Abstract:We consider covert communication in a network with multiple non-orthogonal multiple access (NOMA) systems, where each NOMA system consists of an Alice, a user with the reliability requirement (URR), a user with the covertness requirement (UCR), and a Willie. We first provide theoretical analysis for both covert rate and reliable rate, and then formulate the sum-rate maximization in the network as a non-convex optimization problem. To solve the highly complex optimization problem, we then resort to the reinforcement learning (RL) technique and develop a distributed stateless Q-learning algorithm to identify the optimal covert power allocation for sum-rate maximization. Finally, numerical results are provided to demonstrate the efficiency of using stateless Q-learning in sum-rate maximization.
What problem does this paper attempt to address?