<sup>2</sup>b-sigmoid and <sup>2</sup>b-tanh: Low Hardware Complexity Activation Functions for LSTM

Yuan Zhang,Lele Peng,Lianghua Quan,Shubin Zheng,Qiufeng Feng,Yonggang Zhang,Hui Chen
DOI: https://doi.org/10.1109/ISOCC56007.2022.10031500
2022-01-01
Abstract:Sigmoid and tanh functions are common activation functions in long short-term memory (LSTM) network. However, the natural exponential operations in their formulas are hardware unfriendly. In order to reduce the complexity of their hardware implementation, we propose two new sigmoid and tanh functions called <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$2b$</tex> -sigmoid and <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> b-tanh. Through experiments, we prove that they can still be applied to LSTM. Then, we design an efficient hardware architecture to implement them. Through comparative analysis, we find that the hardware cost of the proposed functions is lower than the traditional sigmoid and tanh functions.
What problem does this paper attempt to address?