Reconfigurable Architecture for Neural Approximation in Multimedia Computing.

Fengbin Tu,Shouyi Yin,Peng Ouyang,Leibo Liu,Shaojun Wei
DOI: https://doi.org/10.1109/tcsvt.2018.2812781
IF: 5.859
2018-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Due to inherent error resiliency, many high performance multimedia applications can be approximated by multi-layer perceptrons (MLPs), with little quality loss. An MLP accelerator can be designed to improve the power efficiency of multimedia systems. However, previous MLP accelerators’ fixed computational pattern lowers the performance when the MLP topology varies for different applications. In this paper, we propose a scheduling framework to guide mapping MLPs onto limited hardware resources. The scheduling framework adjusts the computational patterns for various MLP topologies, obtaining 30% higher performance than the conventional scheduling. We implement a reconfigurable neural architecture (RNA) to support different patterns in the framework and further improve the performance and efficiency. RNA achieves a speedup of $572\times $ on the approximable part, whole application speedup of $7.9\times $ and energy savings of $6.3\times $ , with little quality loss on the benchmarks.
What problem does this paper attempt to address?