Embrace the Conflicts: Exploring the Integration of Single Port Memory in Systolic Array-based Accelerators.

Renyu Yang,Junzhong Shen,Mei Wen,Yasong Cao,Yuhang Li
DOI: https://doi.org/10.1109/hpcc-dss-smartcity-dependsys53884.2021.00044
2021-01-01
Abstract:On-chip memory is one of the core components of deep learning accelerators. In general, the area overhead of the on-chip memory accounts for over 25 % of the total chip area. With the increasing complexity of deep learning algorithms, it will become a challenge for the accelerators to integrate much larger on-chip memory responding to algorithm needs. To solve it, this paper explores the use of Single Port memory(SPM) in systolic array based deep learning accelerators. We propose an efficient address transformation method to avoid the conflict of simultaneous read and write requests on the SPM. In addition, we prove that this method is feasible and can be implemented on hardware without affecting the computation efficiency of the accelerator. Experimental results show that it has about 30 % improvement in area cost when accelerator integrates SPM without affecting the throughput of the accelerator.
What problem does this paper attempt to address?