PipeOrgan: Efficient Inter-operation Pipelining with Flexible Spatial Organization and Interconnects

Raveesh Garg,Hyoukjun Kwon,Eric Qin,Yu-Hsin Chen,Tushar Krishna,Liangzhen Lai
2024-05-03
Abstract:Because of the recent trends in Deep Neural Networks (DNN) models being memory-bound, inter-operator pipelining for DNN accelerators is emerging as a promising optimization. Inter-operator pipelining reduces costly on-chip global memory and off-chip memory accesses by forwarding the output of a layer as the input of the next layer within the compute array, which is proven to be an effective optimization by previous works.
Hardware Architecture
What problem does this paper attempt to address?