From Circuits to SoC Processors: Arithmetic Approximation Techniques & Embedded Computing Methodologies for DSP Acceleration

Vasileios Leon
2023-12-22
Abstract:The computing industry is forced to find alternative design approaches and computing platforms to sustain increased power efficiency, while providing sufficient performance. Among the examined solutions, Approximate Computing, Hardware Acceleration, and Heterogeneous Computing have gained great momentum. In this Dissertation, we introduce design solutions and methodologies, built on top of the preceding computing paradigms, for the development of energy-efficient DSP and AI accelerators. In particular, we adopt the promising paradigm of Approximate Computing and apply new approximation techniques in the design of arithmetic circuits. The proposed arithmetic approximation techniques involve bit-level optimizations, inexact operand encodings, and skipping of computations, while they are applied in both fixed- and floating-point arithmetic. We also conduct an extensive exploration on combinations among the approximation techniques and propose a low-overhead scheme for seamlessly adjusting the approximation degree of our circuits at runtime. Based on our methodology, these arithmetic approximation techniques are then combined with hardware design techniques to implement approximate ASIC- and FPGA-based DSP and AI accelerators. Moreover, we propose methodologies for the efficient mapping of DSP/AI kernels on distinctive embedded devices, i.e., the space-grade FPGAs and the heterogeneous VPUs. On the one hand, we cope with the decreased flexibility of space-grade technology and the technical challenges that arise in new FPGA tools. On the other hand, we unlock the full potential of heterogeneity by exploiting all the diverse processors and memories. Based on our methodology, we efficiently map computer vision algorithms onto the radiation-hardened NanoXplore's FPGAs and accelerate DSP & CNN kernels on Intel's Myriad VPUs.
Hardware Architecture
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the energy - efficiency ratio of computing systems through approximate computing, hardware acceleration, and heterogeneous computing methods in the fields of digital signal processing (DSP) and artificial intelligence (AI). Specifically, the paper focuses on the following aspects: 1. **Approximate Computing**: Utilize the inherent tolerance of DSP and AI applications to errors to achieve gains in power consumption, area, and/or performance by reducing the quality of results. The paper proposes new approximate techniques, such as bit - level optimization, imprecise operand encoding, and computation skipping, and applies these techniques in fixed - point and floating - point arithmetic. 2. **Hardware Acceleration**: Transfer high - demand computing tasks from general - purpose processors to dedicated hardware for execution, such as application - specific integrated circuits (ASICs) and field - programmable gate arrays (FPGAs), to improve efficiency. 3. **Heterogeneous Computing**: Design flexible processing architectures, such as vision processing units (VPUs), which integrate multiple types of processors and different memory technologies to adapt to diverse computing requirements. The main contribution of the paper lies in developing a series of design solutions and methodologies for energy - efficient DSP and AI accelerators, especially by combining the above - mentioned computing paradigms to achieve this goal. In addition, the paper also explores how to efficiently map DSP/AI kernels onto new spatial - level FPGAs and heterogeneous VPUs to overcome the technical challenges of these devices and fully utilize their hardware complexity and diversity. Through these methods, the paper aims to provide an effective way to balance resource utilization and performance improvement, especially in resource - constrained environments, such as space applications.