Subgraph Decoupling and Rescheduling for Increased Utilization in CGRA Architecture.

Chen Yin,Qin Wang,Jianfei Jiang,Weiguang Sheng,Guanghui He,Zhigang Mao,Naifeng Jing
DOI: https://doi.org/10.23919/DATE51398.2021.9474195
2021-01-01
Abstract:When coarse-grained reconfigurable array (CGRA) architecture is shifting towards general-purpose, some complex control flows, such as nested loop, conditional branch and data dependence, may embarrass it and reduce the processing element (PE) array utilization by breaking the intact dataflow graph (DFG) into multiple regions with inconsistent control regions. This paper proposes subgraph decoupling and rescheduling, which decouples the inconsistent regions into control-independent subgraphs. Each subgraph can be rescheduled with zero-cost domino context switching and parallelized to fully utilize the PE resources. Then, we propose lightweight hardware changes based on general CGRA architecture to enable our design. The experiment results show that our proposal can improve the performance and energy efficiency by 1.35x and 1.18x over a static-mapped CGRA (Plasticine), and by 1.27x and 1.45x over an instruction-driven CGRA (TIA).
What problem does this paper attempt to address?