Efficient FPGA Implementation of softmax Layer in Deep Neural Network

Quan Zhang,Jian Cao,Shiguang Zhang,Qi Zhang,Ying Zhang,Yuan wang
DOI: https://doi.org/10.1109/ICSIP49896.2020.9339290
2020-01-01
Abstract:Deep neural network (DNN) is an important technology in the field of artificial intelligence. The softmax layer is one of the key component layers for completing multi-classifcation tasks. The softmax layer contains a large number of exponents and division operations, which causes it to consume a lot of hardware resources in FPGA implementation. This paper presents an efficient FPGA implementation of the softmax layer. For the exponential operation of the softmax layer, the mathematical transformation better than the Coordinate Rotation Digital Computer (CORDIC) algorithm is used to convert to subtraction and 2 exponential operation. The division is converted into a subtraction and shift operation, and added the highest non-zero bit detection processing before the operation to greatly reduce the cycle of the division operation. Experimenting with our softmax architecture on Xilinx ZCU102, the results show that the FPGA implementation of softmax layer can attain the precision of magnitude 10-6 and greatly accelerate the calculation of the softmax layer.
What problem does this paper attempt to address?