A Configurable FPGA Accelerator of Bi-LSTM Inference with Structured Sparsity

Shouliang Guo,Chao Fang,Jun Lin,Zhongfeng Wang
DOI: https://doi.org/10.1109/SOCC49529.2020.9524784
2020-01-01
Abstract:To deploy Bi-directional Long Short-Term Memory (Bi-LSTM) on resource-constrained embedded devices, this work presents a configurable FPGA-based Bi-LSTM accelerator enabling structured compression. Firstly, a dense Bi-LSTM model is thoroughly slimed by a hybrid quantization scheme and a structured top-k pruning. Secondly, the energy consumption on external memory access is significantly reduced by...
What problem does this paper attempt to address?