A Power-Efficient Accelerator Based on FPGAs for LSTM Network

Yiwei Zhang,Chao Wang,Lei Gong,Yuntao Lu,Fan Sun,Chongchong Xu,Xi Li,Xuehai Zhou
DOI: https://doi.org/10.1109/cluster.2017.45
2017-01-01
Abstract:Today, artificial neural networks (ANNs) are widely used in a variety of applications, including speech recognition, face detection, disease diagnosis, etc. And as the emerging field of ANNs, Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) which contains complex computational logic. To achieve high accuracy, researchers always build large-scale LSTM networks which are time-consuming and power-consuming. In this paper, we present a hardware accelerator for the LSTM neural network layer based on FPGA Zedboard and use pipeline methods to parallelize the forward computing process. We also implement a sparse LSTM hidden layer, which consumes fewer storage resources than the dense network. Our accelerator is power-efficient and has a higher speed than ARM Cortex-A9 processor.
What problem does this paper attempt to address?