A Multi-Mode Visual Recognition Hardware Accelerator for AR/MR Glasses

Yunhui Zhu,Yaohua Zuo,Tong Zhou,Guoping Fan
DOI: https://doi.org/10.1109/iscas.2018.8350918
2018-01-01
Abstract:A multi-mode visual recognition hardware accelerator for AR/MR glasses is designed in this paper. The accelerator supports state-of-the-art deep neural networks, including DNN, CNN, RNN and LSTM. To achieve higher utilization rate of computational components, the accelerator supports two mapping modes of neural networks to physical computational structures in a single PE (Processing Engine) array: a) 2D systolic flow of both filter and image data, b) each neural network output maps to a PE. The accelerator adaptively chooses the more efficient mapping mode layer by layer, achieving a higher PE utilization rate than single mapping mode accelerators. When benchmarking with Inception-v4 network, the accelerator's PE utilization rate is 79.5%, which is 20 points higher than state-of-the-art embedded accelerators. Higher PE utilization rate contributes to lower latency, higher throughput, less power consumption and smaller chip area. When benchmarking with AlexNet, the accelerator processes 108 images per second, which fully meets the real-time requirement for AR/MR applications.
What problem does this paper attempt to address?