ImageMap: Enabling Efficient Mapping from Image Processing DSL to CGRA.

Bizhao Shi,Tuo Dai,Sunan Zou,Xinming Wei,Guojie Luo
DOI: https://doi.org/10.1007/978-3-031-69577-3_5
2024-01-01
Abstract:Image processing has broad application scenarios and fast-developing algorithms, requiring hardware platforms with both high energy efficiency and flexible programmability. Coarse-grained Reconfigurable Arrays (CGRAs) show great potential with their regular parallel architectures and word-level spatio-temporal reconfigurability. However, the mapping of image processing applications on CGRAs faces two main challenges: 1) low-level CGRA programming with multiple constraints brings difficulties for developers; 2) the gaps between coarse-grained image pipelines and the fine-grained pipelined loops executed on CGRA generate a huge program transformation space. In order to tackle these challenges, we propose ImageMap, an efficient mapping framework from Halide, an image processing DSL, to CGRAs. Firstly, we propose multi-level partitioning with the extended Halide scheduling primitives to decompose the complex applications systematically. Secondly, we propose a hierarchical program exploration algorithm specialized for CGRAs with the consideration of partitioning and CGRA performance modeling. Thirdly, we build an automatic compilation framework with several compilation optimization techniques to improve the mapping quality. In the experimental evaluations, ImageMap performs better than existing work across multiple CGRA architectures and common image processing applications.
What problem does this paper attempt to address?