Decoding of Polar Codes Based on Q-Learning-Driven Belief Propagation

L. M. Oliveira,R. M. Oliveira,R. C. de Lamare
DOI: https://doi.org/10.48550/arXiv.2111.00673
2021-11-01
Abstract:This paper presents an enhanced belief propagation (BP) decoding algorithm and a reinforcement learning-based BP decoding algorithm for polar codes. The enhanced BP algorithm weighs each Processing Element (PE) input based on their signals and Euclidean distances using a heuristic metric. The proposed reinforcement learning-based BP decoding strategy relies on reweighting the messages and consists of two steps: we first weight each PE input based on their signals and Euclidean distances using a heuristic metric, then a Q-learning algorithm (QLBP) is employed to figure out the best correction factor for successful decoding. Simulations show that the proposed enhanced BP and QLBP decoders outperform the successive cancellation (SC) and belief propagation (BP) decoders, and approach the SCL decoders.
Information Theory
What problem does this paper attempt to address?
The paper attempts to address the problem of improving the decoding performance of Polar Codes. Specifically, the authors propose an enhanced Belief Propagation (BP) decoding algorithm and a Q-learning-based Belief Propagation (QLBP) decoding algorithm to improve the performance of traditional decoding methods. ### Background - **Polar Codes**: Polar Codes are a significant breakthrough in coding theory proposed by Arikan in 2009. They can achieve channel capacity and are widely used in 5G communication standards. - **Traditional Decoders**: - **Successive Cancellation (SC) Decoder**: Although it has low complexity, its performance is poor, especially in high real-time applications. - **Successive Cancellation List (SCL) Decoder**: It improves error correction performance by storing the most likely codeword list, but it has high computational complexity. - **Belief Propagation (BP) Decoder**: It has the advantages of parallel processing, high throughput, and low latency, but requires many iterations to achieve good performance. ### Proposed Methods 1. **Enhanced BP Decoding Algorithm**: - Introduces a weighting technique based on Euclidean distance and signal to weight the input of each Processing Element (PE). - This weighting technique aims to optimize the message passing process, thereby improving decoding performance. 2. **Q-learning-based BP Decoding Algorithm (QLBP)**: - Further introduces the Q-learning algorithm on the basis of the enhanced BP algorithm to dynamically adjust the weighting factors. - The Q-learning algorithm learns the optimal weighting factors through a reward mechanism to ensure optimized decoding performance. ### Experimental Results - **Simulation Results Show**: The proposed enhanced BP and QLBP decoders outperform traditional SC and BP decoders in terms of Bit Error Rate (BER) and Frame Error Rate (FER), and are close to the performance of the SCL decoder. - **Applicability**: The QLBP algorithm performs particularly well under low Signal-to-Noise Ratio (SNR) conditions, making it suitable for decoding tasks in low SNR environments. ### Conclusion - The paper significantly improves the decoding performance of Polar Codes, especially in low SNR environments, by introducing an enhanced BP decoding algorithm and a Q-learning-based BP decoding algorithm. - Future work will consider directions such as puncturing techniques and multi-antenna systems to further optimize decoding performance.