Abstract:The increasing adoption of Deep Neural Network (DNN)-based Digital Pre-distortion (DPD) in modern communication systems necessitates efficient hardware implementations. This paper presents DPD-NeuralEngine, an ultra-fast, tiny-area, and power-efficient DPD accelerator based on a Gated Recurrent Unit (GRU) neural network (NN). Leveraging a co-designed software and hardware approach, our 22 nm CMOS implementation operates at 2 GHz, capable of processing I/Q signals up to 250 MSps. Experimental results demonstrate a throughput of 256.5 GOPS and power efficiency of 1.32 TOPS/W with DPD linearization performance measured in Adjacent Channel Power Ratio (ACPR) of -45.3 dBc and Error Vector Magnitude (EVM) of -39.8 dB. To our knowledge, this work represents the first AI-based DPD application-specific integrated circuit (ASIC) accelerator, achieving a power-area efficiency (PAE) of 6.6 TOPS/W/mm$^2$.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the growing need for efficient hardware implementation in modern communication systems, especially in 5G and future 6G systems. Specifically, as wireless communication systems evolve towards higher data rates and wider bandwidths, the application of digital pre - distortion (DPD) algorithms in the digital baseband back - end (DBE) has become crucial. However, traditional DPD techniques such as the generalized memory polynomial (GMP) model have difficulty meeting the linearization performance requirements of broadband power amplifiers (PAs), especially when faced with strict frequency and delay constraints. To solve these problems, the paper proposes an application - specific integrated circuit (ASIC) accelerator named DPD - NeuralEngine. This accelerator is based on the gated recurrent unit (GRU) neural network (RNN) and is designed to process I/Q signals up to 250 MSps and achieves the following key objectives: 1. **Efficient Hardware Implementation**: Through a software - hardware co - design method, an ultra - fast, small - area, and high - energy - efficiency DPD accelerator is achieved. 2. **High Throughput and Low Power Consumption**: Experimental results show that this accelerator reaches a throughput of 256.5 GOPS and an energy efficiency of 1.32 TOPS/W. 3. **Excellent Linearization Performance**: Performance indicators of - 45.3 dBc in adjacent channel power ratio (ACPR) and - 39.8 dB in error vector magnitude (EVM) are achieved respectively. 4. **The First AI - based DPD ASIC Accelerator**: To the best of the authors' knowledge, this is the first AI - based DPD ASIC accelerator, achieving a power - area efficiency (PAE) of 6.6 TOPS/W/mm². In summary, this paper aims to solve the challenges faced by DPD algorithms in modern communication systems through efficient hardware design and advanced neural network algorithms, especially the requirements for high performance and low power consumption.

DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm$^2$ Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion

MP-DPD: Low-Complexity Mixed-Precision Neural Networks for Energy-Efficient Digital Predistortion of Wideband Power Amplifiers

An Enhanced Peak Limited Digital Predistortion Based on Indirect Learning Architecture

A Uniform Neural Network Digital Predistortion Model of RF Power Amplifiers for Scalable Applications

A fine-grained mixed precision DNN accelerator using a two-stage big-little core RISC-V MCU.

PulseDL-II: A System-on-Chip Neural Network Accelerator for Timing and Energy Extraction of Nuclear Detector Signals

OpenDPD: An Open-Source End-to-End Learning & Benchmarking Framework for Wideband Power Amplifier Modeling and Digital Pre-Distortion

Efficient attention guided 5G power amplifier digital predistortion

Digital Predistortion Technique and Its Development

A Data-Driven Asynchronous Neural Network Accelerator

Digital Predistortion of RF Power Amplifiers With Phase-Gated Recurrent Neural Networks

An Electro-Photonic System for Accelerating Deep Neural Networks

Dynamic Precision Multiplier For Deep Neural Network Accelerators

PIXEL: Photonic Neural Network Accelerator

Neural Synaptic Plasticity-Inspired Computing: A High Computing Efficient Deep Convolutional Neural Network Accelerator

Deep Neural Network-Based Digital Pre-Distortion for High Baudrate Optical Coherent Transmission

A Cascaded Memory Polynomial-Neural Network Behavior Model For Digital Predistortion

Continual Learning Digital Predistortion of RF Power Amplifier for 6G AI-Empowered Wireless Communication

PL-NPU: an Energy-Efficient Edge-Device DNN Training Processor with Posit-Based Logarithm-Domain Computing

Ifpna: A Flexible and Efficient Deep Neural Network Accelerator with a Programmable Data Flow Engine in 28nm CMOS.