Abstract:The posit number system aims to be a drop-in replacement of the existing IEEE floating-point standard. Its properties- tapered precision and high dynamic range, allow a smaller size posit to almost match the performance of a much larger size floating-point in representing decimals. This becomes especially useful for performing error-tolerant tasks like deep learning inference computation where low latency and area are a priority. Recent research has found that the performance of deep neural network models saturates beyond a certain level of accuracy of multipliers used for convolutions. Therefore, the extra hardware cost of developing precise arithmetic circuits for such applications becomes an unnecessary overhead. This paper explores approximate posit multipliers in the convolutional layers of deep neural networks and attempts to find an ideal balance between hardware utilization and inference accuracy. Posit multiplication involves several steps, with the mantissa multiplication step utilizing maximum hardware resources. To mitigate this, a posit multiplier circuit using an approximate hybrid-radix Booth encoding for mantissa multiplication and techniques such as truncation and bit masking based on input regime size are proposed. In addition, a novel Booth encoding control scheme to prevent unnecessary bits from switching has been devised to reduce dynamic power dissipation. Compared to existing literature, these optimizations have contributed to a 23% decrease in power dissipation in the mantissa multiplication stage. Further, a novel area and energy-efficient decoder architecture have also been developed with an 11% reduction in dynamic power dissipation and area compared to existing decoders. Overall, the proposed posit multiplier offers a 14% reduction in the PDP over the existing approximate posit multiplier designs. The proposed multiplier also achieves over 90% accuracy in inferencing deep learning models such as ResNet20, VGG-19 and DenseNet.

ADEPNET: A Dynamic-Precision Efficient Posit Multiplier for Neural Networks

A Low-Power In-Memory Multiplication and Accumulation Array with Modified Radix-4 Input and Canonical Signed Digit Weights

Optimally Approximated and Unbiased Floating-Point Multiplier with Runtime Configurability

A Reconfigurable Multiplier for Signed Multiplications with Asymmetric Bit-Widths.

A Reconfigurable Approximate Multiplier for Quantized CNN Applications.

Adaptable Approximate Multiplier Design Based on Input Distribution and Polarity

PAM: A Piecewise-Linearly-Approximated Floating-Point Multiplier with Unbiasedness and Configurability

Multiplier-less Artificial Neurons Exploiting Error Resiliency for Energy-Efficient Neural Computing

Efficient Approximate Floating-Point Multiplier With Runtime Reconfigurable Frequency and Precision

Area-Efficient Iterative Logarithmic Approximate Multipliers for IEEE 754 and Posit Numbers

HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

A fine-grained mixed precision DNN accelerator using a two-stage big-little core RISC-V MCU.

Bit-pragmatic Deep Neural Network Computing

A Hardware- and Accuracy-Efficient Approximate Multiplier with Error Compensation for Neural Network and Image Processing Applications

Low Error-Rate Approximate Multiplier Design for DNNs with Hardware-Driven Co-Optimization

A Logarithmic Floating-Point Multiplier for the Efficient Training of Neural Networks

PositNN: Training Deep Neural Networks with Mixed Low-Precision Posit

Dynamic Precision Multiplier For Deep Neural Network Accelerators

Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration

Cheetah: Mixed Low-Precision Hardware & Software Co-Design Framework for DNNs on the Edge

Permutation-Based Approximate Multiplier with High Accuracy.