Abstract:This paper introduces hybrid automatic repeat request with incremental redundancy (HARQ-IR) to boost the reliability of short packet communications. The finite blocklength information theory and correlated decoding events tremendously preclude the analysis of average block error rate (BLER). Fortunately, the recursive form of average BLER motivates us to calculate its value through the trapezoidal approximation and Gauss-Laguerre quadrature. Moreover, the asymptotic analysis is performed to derive a simple expression for the average BLER at high signal-to-noise ratio (SNR). Then, we study the maximization of long term average throughput (LTAT) via power allocation meanwhile ensuring the power and the BLER constraints. For tractability, the asymptotic BLER is employed to solve the problem through geometric programming (GP). However, the GP-based solution underestimates the LTAT at low SNR due to a large approximation error in this case. Alternatively, we also develop a deep reinforcement learning (DRL)-based framework to learn power allocation policy. In particular, the optimization problem is transformed into a constrained Markov decision process, which is solved by integrating deep deterministic policy gradient (DDPG) with subgradient method. The numerical results finally demonstrate that the DRL-based method outperforms the GP-based one at low SNR, albeit at the cost of increasing computational burden.
What problem does this paper attempt to address?
### Problems the paper attempts to solve
This paper aims to improve the reliability of short - packet communication by introducing Hybrid Automatic Repeat - reQuest with Incremental Redundancy (HARQ - IR) technology. Specifically, the paper mainly focuses on the following aspects:
1. **Analysis of Average Block Error Rate (BLER)**:
- Due to the limited block length in short - packet communication, the classical Shannon information theory is no longer applicable. Therefore, by using the finite - block - length information theory and the recursive form of related decoding events, the paper proposes methods for calculating the average BLER using the trapezoidal approximation method and the Gauss - Laguerre quadrature method.
- Further, the paper conducts an asymptotic analysis under high Signal - to - Noise Ratio (SNR) conditions and derives a simplified expression for the average BLER.
2. **Maximization of Long - Term Average Throughput (LTAT)**:
- The paper studies how to maximize LTAT through power allocation while ensuring that power and BLER constraints are met.
- To make the problem solvable, the paper uses the asymptotic BLER expression to solve the optimization problem by Geometric Programming (GP) method. However, the GP method will underestimate LTAT under low SNR conditions.
- For this reason, the paper also proposes a framework based on Deep Reinforcement Learning (DRL) to learn the power allocation strategy. Specifically, the optimization problem is transformed into a Constrained Markov Decision Process (MDP) and solved by combining Deep Deterministic Policy Gradient (DDPG) and the sub - gradient method.
### Main contributions
1. **Numerical evaluation of average BLER**:
- Proposes the trapezoidal approximation method and the Gauss - Laguerre quadrature method to numerically evaluate the average BLER, and further reduces the complexity of the Gauss - Laguerre quadrature method through dynamic programming.
- Conducts an asymptotic analysis under high SNR conditions and obtains a simplified expression for the average BLER.
2. **Power allocation scheme for LTAT maximization**:
- Using the asymptotic results, transforms the optimization problem into a convex problem and solves it by the geometric programming method.
- Aiming at the performance loss under low SNR conditions, proposes a method based on deep reinforcement learning, which solves the problem by constructing a constrained MDP and combining DDPG and the sub - gradient method.
3. **Numerical analysis**:
- Numerical results show that the DRL - based method is superior to the GP - based method under low SNR conditions, although its computational burden in the off - line training phase is relatively high.
### Structural overview
- **System model**: Introduces the system model of HARQ - IR - assisted short - packet communication, including the transmission protocol and the reliability performance indicator (BLER).
- **Average BLER analysis**: Derives in detail the calculation method of the average BLER, including numerical evaluation and asymptotic analysis.
- **LTAT maximization**: Studies how to maximize LTAT through power allocation and proposes two methods based on GP and DRL.
- **Numerical analysis**: Verifies the effectiveness of the proposed methods through numerical experiments.
- **Conclusion**: Summarizes the main contributions of the paper and future research directions.
### Formula summary
1. **Received signal model**:
\[
y_m=\sqrt{P_m}h_mx_m + n_m
\]
where \( P_m \) is the transmission power, \( x_m \) is the transmitted sub - codeword, \( n_m \) is the complex Gaussian noise, and \( h_m \) is the channel coefficient.
2. **Conditional BLER**:
\[
\epsilon_M\approx Q\left(\frac{1}{\sqrt{M}}\sum_{m = 1}^M\log_2(1+\bar{\gamma}_m g_m)-R\right)
\]
where \(\bar{\gamma}_m=\frac{P_m}{N_0}\) is the average transmission Signal - to - Noise Ratio, \( R = \frac{K}{L}\) is the initial transmission rate, and \( Q(x)=\int_x^\infty\frac{1}{\sqrt{2\pi}}e^{-\frac{t^2}{2}}dt\).
3. **Average BLER**: