Tetris-SDK: Efficient Convolution Layer Mapping with Adaptive Windows for Fast in Memory Computing

Ke Dong,Kejie Huang,Bo Wang
DOI: https://doi.org/10.1109/iscas58744.2024.10558042
2024-01-01
Abstract:Shifted-and-Duplicated-Kernel (SDK) mapping has emerged as a promising technique for accelerating convolutional layers in Compute-In-Memory (CIM) architectures. While state-of-the-art SDK variants have achieved decent mapping efficiency, optimizations are still desired to enhance CIM utilization and reduce computing cycles. In this work, we propose Tetris-SDK, a novel tool that exploits adaptive windows to further improve the performance of convolution layer mapping. These windows can accommodate a larger number of input channels, increase array utilization at marginal space, and adjust window shapes to minimize compute latency. Our experiments with a 512x512 CIM array demonstrate that Tetris-SDK remarkably accelerates CNN layers up to 78.4x, 8x, and 1.3x compared to the baseline mapping algorithms, i.e., img2col, SDK, and VW-SDK, respectively. This shows that Tetris-SDK is a promising design automation solution to map Convolutional Neural Networks in CIM hardware.
What problem does this paper attempt to address?