An Enhanced Data Cache with In-Cache Processing Units for Convolutional Neural Network Accelerators

Yu-Chao Zhou,Mai Lei,Yong-Liang Zhang,Quan Zhang,Jun Han
DOI: https://doi.org/10.1109/ICSICT49897.2020.9278154
2020-01-01
Abstract:Convolutional neural network algorithms rely on large quantities of data to extract hidden information. In order to achieve good performance, a CNN accelerator should be able to easily access a wide range of memory space. However, on-chip memory has limited capacity, and needs to be manually managed by the accelerator controller. Besides, accessing off-chip memory like a DRAM array needs a physical address, which is not calculated until a memory access request arrives at a memory access control unit. Managing those issues manually complicates CNN accelerator design. Therefore, we propose a data cache enhanced with in-cache processing units, which processes segment address in parallel with data, and exploits data locality in an automatic manner. Overheads such as physical address translation, quantization, and ReLU, are offloaded from the accelerator controller.
What problem does this paper attempt to address?