Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning

Shulin Song,Zheng Zhang,Qiong Wu,Qiang Fan,Pingyi Fan
2024-07-11
Abstract:Autonomous driving may be the most important application scenario of next generation, the development of wireless access technologies enabling reliable and low-latency vehicle communication becomes crucial. To address this, 3GPP has developed Vehicle-to-Everything (V2X) specifications based on 5G New Radio (NR) technology, where Mode 2 Side-Link (SL) communication resembles Mode 4 in LTE-V2X, allowing direct communication between vehicles. This supplements SL communication in LTE-V2X and represents the latest advancement in cellular V2X (C-V2X) with improved performance of NR-V2X. However, in NR-V2X Mode 2, resource collisions still occur, and thus degrade the age of information (AOI). Therefore, a interference cancellation method is employed to mitigate this impact by combining NR-V2X with Non-Orthogonal multiple access (NOMA) technology. In NR-V2X, when vehicles select smaller resource reservation interval (RRI), higher-frequency transmissions take ore energy to reduce AoI. Hence, it is important to jointly consider AoI and communication energy consumption based on NR-V2X communication. Then, we formulate such an optimization problem and employ the Deep Reinforcement Learning (DRL) algorithm to compute the optimal transmission RRI and transmission power for each transmitting vehicle to reduce the energy consumption of each transmitting vehicle and the AoI of each receiving vehicle. Extensive simulations have demonstrated the performance of our proposed algorithm.
Machine Learning,Networking and Internet Architecture,Signal Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to jointly optimize the freshness of information (Age of Information, AoI) and energy consumption in the Internet of Vehicles based on the 5G NR - V2X (New Radio Vehicle - to - Everything) system. Specifically: 1. **Resource Collision Problem**: In NR - V2X mode 2, due to the existence of the autonomous resource allocation method, resource collisions may occur between vehicles, thereby degrading the performance of AoI. 2. **Trade - off between Energy Consumption and AoI**: When a vehicle selects a smaller resource reservation interval (RRI), although it can transmit more frequently to reduce AoI, this will increase energy consumption. Therefore, a balance point needs to be found between AoI and energy consumption. To solve these problems, the authors propose the following methods: - **Introducing Non - Orthogonal Multiple Access (NOMA) Technology**: Use NOMA technology to alleviate the impact of resource collisions on AoI. NOMA allows multiple users to communicate simultaneously in the same frequency band, thereby reducing interference and improving SINR (Signal - to - Interference - plus - Noise Ratio). - **Optimization Algorithm Based on Deep Reinforcement Learning (DRL)**: Construct an optimization problem and use a deep reinforcement learning algorithm to calculate the optimal RRI and transmission power for each transmitting vehicle, in order to reduce the energy consumption of each transmitting vehicle and the AoI of the receiving vehicle. Finally, the effectiveness of the proposed algorithm is verified through extensive simulations, proving that this method can reduce energy consumption during the communication process while ensuring a low AoI. ### Summary of Mathematical Formulas - **SINR Expression**: \[ \eta_{i \rightarrow j}^t=\frac{h_t^s h_{i \rightarrow j}^t p_i^t}{L_d(d_{i \rightarrow j}) I_{i \rightarrow j}^t + p_n} \] where: - \(h_t^s\) is the random small - scale fading gain. - \(h_{i \rightarrow j}^t\) is the large - scale fading gain. - \(p_i^t\) is the transmission power of vehicle \(i\). - \(L_d(d_{i \rightarrow j})\) is the path loss. - \(I_{i \rightarrow j}^t\) is the interference power. - \(p_n\) is the noise power. - **Interference Power Expression**: \[ I_{i \rightarrow j}^t=\sum_{k \in V_t, k \neq i} \sigma_{k,i}^t h_t^s h_{k \rightarrow j}^t \frac{p_k^t}{L_d(d_{k \rightarrow j})} \] - **Successful Communication Condition**: \[ u_{i \rightarrow j}^t=\left\lfloor\frac{W_t^i \log_2(1 + \eta_{i \rightarrow j}^t)}{G}\right\rfloor \] - **AoI Change**: \[ \Phi_{i \rightarrow j}^{t+\Gamma}= \begin{cases} \phi_{t,1}^n+\Gamma & \text{if } u_{i \rightarrow j}^t = 1\\ \Phi_{i \rightarrow j}^t+\Gamma & \text{if } u_{i \rightarrow j}^t = 0 \end{cases} \] - **Optimization Objective**: \[ \min_{\Gamma_t, p_t}\left[\omega_1 E+\omega_2 \Phi\right]