DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion

Ang Li,Haolin Wu,Yizhuo Wu,Qinyu Chen,Leo C. N. de Vreede,Chang Gao
2024-10-16
Abstract:The increasing adoption of Deep Neural Network (DNN)-based Digital Pre-distortion (DPD) in modern communication systems necessitates efficient hardware implementations. This paper presents DPD-NeuralEngine, an ultra-fast, tiny-area, and power-efficient DPD accelerator based on a Gated Recurrent Unit (GRU) neural network (NN). Leveraging a co-designed software and hardware approach, our 22 nm CMOS implementation operates at 2 GHz, capable of processing I/Q signals up to 250 MSps. Experimental results demonstrate a throughput of 256.5 GOPS and power efficiency of 1.32 TOPS/W with DPD linearization performance measured in Adjacent Channel Power Ratio (ACPR) of -45.3 dBc and Error Vector Magnitude (EVM) of -39.8 dB. To our knowledge, this work represents the first AI-based DPD application-specific integrated circuit (ASIC) accelerator, achieving a power-area efficiency (PAE) of 6.6 TOPS/W/mm$^2$.
Hardware Architecture,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the growing need for efficient hardware implementation in modern communication systems, especially in 5G and future 6G systems. Specifically, as wireless communication systems evolve towards higher data rates and wider bandwidths, the application of digital pre - distortion (DPD) algorithms in the digital baseband back - end (DBE) has become crucial. However, traditional DPD techniques such as the generalized memory polynomial (GMP) model have difficulty meeting the linearization performance requirements of broadband power amplifiers (PAs), especially when faced with strict frequency and delay constraints. To solve these problems, the paper proposes an application - specific integrated circuit (ASIC) accelerator named DPD - NeuralEngine. This accelerator is based on the gated recurrent unit (GRU) neural network (RNN) and is designed to process I/Q signals up to 250 MSps and achieves the following key objectives: 1. **Efficient Hardware Implementation**: Through a software - hardware co - design method, an ultra - fast, small - area, and high - energy - efficiency DPD accelerator is achieved. 2. **High Throughput and Low Power Consumption**: Experimental results show that this accelerator reaches a throughput of 256.5 GOPS and an energy efficiency of 1.32 TOPS/W. 3. **Excellent Linearization Performance**: Performance indicators of - 45.3 dBc in adjacent channel power ratio (ACPR) and - 39.8 dB in error vector magnitude (EVM) are achieved respectively. 4. **The First AI - based DPD ASIC Accelerator**: To the best of the authors' knowledge, this is the first AI - based DPD ASIC accelerator, achieving a power - area efficiency (PAE) of 6.6 TOPS/W/mm². In summary, this paper aims to solve the challenges faced by DPD algorithms in modern communication systems through efficient hardware design and advanced neural network algorithms, especially the requirements for high performance and low power consumption.