An Ultra-High Energy-Efficient Reconfigurable Processor for Deep Neural Networks with Binary/Ternary Weights in 28NM CMOS

Shouyi Yin,Peng Ouyang,Jianxun Yang,Tianyi Lu,Xiudong Li,Leibo Liu,Shaojun Wei
DOI: https://doi.org/10.1109/vlsic.2018.8502388
2018-01-01
Abstract:An energy efficient reconfigurable processor for deep neural networks with binary/ternary weights and 1/2/4/8/16-bit activations is implemented in 28nm technology. Three technologies, Total-Partial-Pixel-Summation (TPPS), Kernel-Transformation-Data-Reconstruction (KTDR) and Hybrid Load-Balancing Mechanism (HLBM), are employed to improve energy efficiency. Measurement results show that the energy efficiency of at most 95.8 TOPS/W for BWN, and 95.1 TOPS/W for TWN and 765.6 TOPS/W for BNN is achieved, and it shows 6.6x higher over state-of-the-art works.
What problem does this paper attempt to address?