Abstract:Intelligent reflecting surface (IRS)-assisted mobile edge computing (MEC) systems have shown notable improvements in efficiency, such as reduced latency, higher data rates, and better energy efficiency. However, the resource competition among users will lead to uneven allocation, increased latency, and lower throughput. Fortunately, the rate-splitting multiple access (RSMA) technique has emerged as a promising solution for managing interference and optimizing resource allocation in MEC systems. This paper studies an IRS-assisted MEC system with RSMA, aiming to jointly optimize the passive beamforming of the IRS, the active beamforming of the base station, the task offloading allocation, the transmit power of users, the ratios of public and private information allocation, and the decoding order of the RSMA to minimize the average delay from a novel uplink transmission perspective. Since the formulated problem is non-convex and the optimization variables are highly coupled, we propose a hierarchical deep reinforcement learning-based algorithm to optimize both continuous and discrete variables of the problem. Additionally, to better extract channel features, we design a novel network architecture within the policy and evaluation networks of the proposed algorithm, combining convolutional neural networks and densely connected convolutional network for feature extraction. Simulation results indicate that the proposed algorithm not only exhibits excellent convergence performance but also outperforms various benchmarks.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: in the Intelligent Reflecting Surface (IRS) - assisted Mobile Edge Computing (MEC) system, how to introduce the Rate - Splitting Multiple Access (RSMA) technology to optimize resource allocation and manage interference, thereby minimizing the average delay of the system. Specifically, this research aims to jointly optimize the following aspects: 1. **Passive beamforming of IRS**: Adjust the phase and amplitude of the IRS reflection units to optimize the signal transmission quality. 2. **Active beamforming of the base station**: Optimize the direction and power of the base station's transmitted signal to improve communication efficiency. 3. **Task offloading allocation of users**: Decide which tasks are processed locally by users and which tasks are offloaded to the MEC server for processing. 4. **Transmit power of users**: Reasonably allocate the transmit power of users to balance communication quality and energy consumption. 5. **Allocation ratio of public and private information in RSMA**: Control the ratio of public and private information to effectively manage interference. 6. **Decoding order of RSMA**: Determine the decoding order of different user information to maximize the system throughput. Since the proposed optimization problem is non - convex and the optimization variables are highly coupled, traditional optimization methods are difficult to solve. For this reason, the authors propose an algorithm based on Hierarchical Deep Reinforcement Learning (HDRL) to simultaneously optimize continuous and discrete variables. In addition, in order to better extract channel features, a novel network architecture combining Convolutional Neural Network (CNN) and Dense Convolutional Network (DenseNet) is designed. ### Summary of mathematical formulas - **Composite channel gain**: \[ H_n(t)=G(t)\Theta(t)h_{irs}^{n}(t)+h_{dir}^{n}(t) \] where \(\Theta(t)=\text{diag}(\alpha_1 e^{j\theta_1},\alpha_2 e^{j\theta_2},\ldots,\alpha_K e^{j\theta_K})\), \(\alpha_k\in[0, 1]\), \(\theta_k\in[0, 2\pi)\). - **Received signal**: \[ y(t)=\sum_{n = 1}^N H_n(t)x_n(t)+n \] where \(n\) is the Additive White Gaussian Noise (AWGN). - **Transmit power of public and private information**: \[ p_{n,c}(t)=\gamma_n(t)p_n(t),\quad p_{n,p}(t)=(1 - \gamma_n(t))p_n(t) \] - **SINR for decoding public and private information**: \[ \rho_{n,c}(t)=\frac{|w_n^H(t)H_n(t)|^2\gamma_n(t)p_n(t)}{\sum_{l,n\in N|\pi_l(t)>\pi_n(t)}|w_n^H(t)H_l(t)|^2\gamma_l(t)p_l(t)+\sum_{l = 1}^N|w_n^H(t)H_l(t)|^2(1-\gamma_l(t))p_l(t)+\|w_n^H(t)\|^2\sigma^2} \] \[ \rho_{n,p}(t)=\frac{|w_n^H(t)H_n(t)|^2(1-\gamma_n(t))p_n(t)}{\sum_{l\neq n}|w_n^H(t)H_l(t)|^2(1-\gamma_l(t))p_

Hierarchical Learning for IRS-Assisted MEC Systems with Rate-Splitting Multiple Access