Abstract:In areas of signal processing and communications such as antenna array beamforming, adaptive filtering, multi-user and multiple-input multiple-output (MIMO) detection, channel estimation and equalization, echo and interference cancellation and others, solving linear systems of equations often provides an optimal performance. However, this is also a very complicated operation that designers try to avoid by proposing different sub-optimal solutions. The dichotomous coordinate descent (DCD) algorithm allows linear systems of equations to be solved with high computational efficiency. It is a multiplication-free and division-free technique and, therefore, it is well suited for hardware implementation. In this thesis, we present architectures and field-programmable gate array (FPGA) implementations of two variants of the DCD algorithm, known as the cyclic and leading DCD algorithms, for real-valued and complex-valued systems. For each of these techniques, we present architectures and implementations with different degree of parallelism. The proposed architectures allow a trade-off between FPGA resources and the computation time. The fixed-point implementations provide an accuracy performance which is very close to the performance of floating-point counterparts. We also show applications of the designs to complex division, antenna array beamforming and adaptive filtering. The DCD-based complex divider is based on the idea that the complex division can be viewed as a problem of finding the solution of a 2x2 real-valued system of linear equations, which is solved using the DCD algorithm. Therefore, the new divider uses no multiplication and division. Comparing with the classical complex divider, the DCD-based complex divider requires significantly smaller chip area. A DCD-based minimum variance distortionless response (MVDR) beamformer employs the DCD algorithm for multiplication-free finding the antenna array weights. An FPGA implementation of the proposed DCD-MVDR beamformer requires a chip area much smaller and throughput much higher than that achieved with other implementations. The performance of the fixed-point implementation is very close to that of floating-point implementation of the MVDR beamformer using direct matrix inversion. When incorporating the DCD algorithm in recursive least squares (RLS) adaptive filter, a new efficient technique, named as the RLS-DCD algorithm, is derived. The RLS-DCD algorithm expresses the RLS adaptive filtering problem in terms of auxiliary normal equations with respect to increments of the filter weights. The normal equations are approximately solved by using the DCD iterations. The RLS-DCD algorithm is well-suited to hardware implementation and its complexity is as low as O(N2) operations per sample in a general case and O(N) operations per sample for transversal RLS adaptive filters. The performance of the RLS-DCD algorithm, including both fixed-point and floating-point implementations, can be made arbitrarily close to that of the floating-point classical RLS algorithm. Furthermore, a new dynamically regularized RLS-DCD algorithm is also proposed to reduce the complexity of the regularized RLS problem from O(N^3) to O(N^2) in a general case and to O(N) for transversal adaptive filters. This dynamically regularized RLS-DCD algorithm is simple for finite precision implementation and requires small chip resources.

DCD Algorithm : Architectures, FPGA Implementations and Applications

A Novel 3780-Point FFT Processor Scheme for the Time Domain Synchronous OFDM System

Joint User Scheduling and Resource Allocation for Millimeter Wave Systems Relying on Adaptive-Resolution ADCs

Robust DCD-Based Recursive Adaptive Algorithms

An Efficient Architecture for the Modified DLMS Algorithm Using CIC Filters

A High Efficiency DDC Algorithm for Narrow Band Signal

Low Computational Complexity RLS-Based Decision-Feedback Equalization in Underwater Acoustic Communications

A Novel Fully Hardware-Implemented SVD Solver Based on Ultra-Parallel BCV Jacobi Algorithm

An Enhanced Adaptive Recoding Rotation CORDIC.

FPGA implementation of high-performance, resource-efficient Radix-16 CORDIC rotator based FFT algorithm

An Area Optimized Direct Digital Frequency Synthesizer Based on Improved Hybrid CORDIC Algorithm

A DDC algorithm for micro/nano satellite communication system

Optimal Design of RDARS-aided Multi-user Systems with Low-resolution DACs

Hardware-Efficient Realization of Prime-Length DCT Based on Distributed Arithmetic

Implementation of Real-Time LCMV Adaptive Digital Beamforming Technology

Field Programmable Gate Array (FPGA) Implementation of Parallel Jacobi for Eigen-Decomposition in Direction of Arrival (DOA) Estimation Algorithm

An Algorithm for Computing DCT Using Improved Arithmetic Fourier Transform

Implementation Method of CORDIC Algorithm to Improve DDFS Performance

A Multiplier Structure Based on A Novel Real-Time Csd Recoding

Dimension Reduction Linear Constrained Minimum Variance Adaptive Digital Beamforming

FPGA Implementation of an Efficient Adaptive Predistortion Algorithm