Frequency Improvement of Systolic Array-Based CNNs on FPGAs

Jiaxi Zhang,Wentai Zhang,Guojie Luo,Xuechao Wei,Yun Liang,Jason Cong
DOI: https://doi.org/10.1109/iscas.2019.8702071
2019-01-01
Abstract:FPGAs are commercially available off-the-shelf for implementing convolutional neural network (CNN) accelerators to trade off accuracy, performance, and power. Systolic array architecture for CNN accelerators on FPGAs has the potential to run at a high frequency due to its regular and simple interconnections. However, current FPGA CAD tools are unable to synthesize and layout systolic arrays in high quality. In this paper, we identify the reasons for the frequency degradation of systolic array designs for CNN accelerators. We also propose two methods to improve the frequency at the front-end and the back-end, respectively. The experimental results show that our methods are able to achieve 1.29 × higher frequency and attain 1.5TOPS for the VGG16 network on the Xilinx KCU1500 platform.
What problem does this paper attempt to address?