Reconfigurable Approximate Multiplication Architecture for CNN-Based Speech Recognition Using Wallace Tree Tensor Multiplier Unit

Junyi Qian,Yuanyuan Jiang,Zilong Zhang,Renyuan Zhang,Ziyu Wang,Bo Liu
DOI: https://doi.org/10.1109/nanoarch53687.2021.9642240
2021-01-01
Abstract:When the neural network technology is applied to the battery-powered terminal equipment, the energy efficiency of its hardware calculation has become the key problem to be considered. Given this, this paper designs and realizes a reconfigurable approximate multiplication architecture for CNN-Based speech recognition. First, a convolutional neural network reconfigurable computing cell structure is presented. Second, it is extended to the design and implementation of a low-power precision controllable convolutional neural network, which includes the Wallace tree tensor multiplier unit and the design of an approximate compressor. As a case study, the proposed approximate designs are applied to a CNN-based keywords speech recognition system. Under TSMC 22nm ULL UHVT process condition, compared with the speech keyword recognition system without approximate computation, the power consumption of the processing engine with approximate multiplication computation unit is reduced by 51.55%, while the recognition accuracy is reduced by only 1%.
What problem does this paper attempt to address?