Abstract:Approximate computing is a promising design paradigm that introduces a new dimension-error-into the original design space. By allowing the inexact computation in error-tolerance applications, approximate computing can gain both performance and energy efficiency. A neural network (NN) is a universal approximator in theory and possesses a high level of parallelism. The emerging deep neural network accelerators deployed with NN-based approximator is thereby a promising candidate for approximate computing. Nevertheless, the approximation result must satisfy the users' requirement, and the approximation result varies across different applications. We normally deploy an NN-based classifier to ensure the approximation quality. Only the inputs predicted to meet the quality requirement can be executed by the approximator. The potential of these two NNs, however, is fully explored; the involving of two NNs in approximate computing imposes critical optimization questions, such as two NNs' distinct views of the input data space, how to train the two correlated NNs, and what are their topologies. In this article, we propose a novel NN-based approximate computing framework with quality insurance. We advocate a co-training approach that trains the classifier and the approximator alternately to maximize the agreement of the two NNs on the input space. In each iteration, we coordinate the training of the two NNs with a judicious selection of training data. Next, we explore different selection policies and propose to select training data from multiple iterations, which can enhance the invocation of the approximate accelerator. In addition, we optimize the classifier by integrating a dynamic threshold tuning algorithm to improve the invocation of the approximate accelerator further. The increased invocation of accelerator leads to higher energy efficiency under the same quality requirement. We propose two efficient algorithms to explore the smallest topology of the NN-based approximator and the classifier to achieve the quality requirement. The first algorithm straightforward searches the minimum topology using a greedy strategy. However, the first algorithm incurs too much training overhead. To solve this issue, the second one gradually grows the topology of NNs to match the quality requirement by transferring the learned parameters. Experimental results show significant improvement on the quality and the energy efficiency compared to the existing NN-based approximate computing frameworks.

INA: Incremental Network Approximation Algorithm for Limited Precision Deep Neural Networks

Efficient Approximate Floating-Point Multiplier With Runtime Reconfigurable Frequency and Precision

Low Error-Rate Approximate Multiplier Design for DNNs with Hardware-Driven Co-Optimization

Training Neural Networks for Execution on Approximate Hardware

Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration

Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

Special Session: Approximation and Fault Resiliency of DNN Accelerators

ALWANN: Automatic Layer-Wise Approximation of Deep Neural Network Accelerators without Retraining

Deep Neural Network Approximation for Custom Hardware

A Hardware/Software Co-Design Methodology for Adaptive Approximate Computing in Clustering and ANN Learning

Approximate Logic Synthesis in the Loop for Designing Low-Power Neural Network Accelerator.

AX-DBN: An Approximate Computing Framework for the Design of Low-Power Discriminative Deep Belief Networks

Concrete: A Per-layer Configurable Framework for Evaluating DNN with Approximate Operators

A FPGA Friendly Approximate Computing Framework with Hybrid Neural Networks: (Abstract Only).

ApproxPilot: A GNN-based Accelerator Approximation Framework

ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference

A Hardware- and Accuracy-Efficient Approximate Multiplier with Error Compensation for Neural Network and Image Processing Applications

ApproxDNNFlow: an Evaluation and Exploration Framework for DNNs with Approximate Multipliers

Effects of Approximation in Computation on the Accuracy and Performance of Deep Neural Network Inference

AxTrain

Energy-Efficient and Quality-Assured Approximate Computing Framework Using a Co-Training Method.