Angel-Eye: A Complete Design Flow for Mapping CNN Onto Customized Hardware.

Kaiyuan Guo,Lingzhi Sui,Jiantao Qiu,Song Yao,Song Han,Yu Wang,Huazhong Yang
DOI: https://doi.org/10.1109/isvlsi.2016.129
2016-01-01
Abstract:Convolutional Neural Network (CNN) has become a successful algorithm in the region of artificial intelligence and a strong candidate for many applications. However, for embedded platforms, CNN-based solutions are still too complex to be applied if only CPU is utilized for computation. Various dedicated hardware designs on FPGA and ASIC have been carried out to accelerate CNN, while few of them explore the whole design flow for both fast deployment and high power efficiency. In this paper, we propose Angel-Eye, a programmable and flexible CNN processor architecture, together with compilation tool and runtime environment. Evaluated on Zynq XC7Z045 platform, Angel-Eye is 8× faster and 7× better in power efficiency than peer FPGA implementation on the same platform. A demo of face detection on XC7Z020 is also 20× and 15× more energy efficient than counterparts on mobile CPU and mobile GPU respectively.
What problem does this paper attempt to address?