Low-Power Loop Parallelization Onto CGRA Utilizing Variable Dual VDD

Bing Xu,Shouyi Yin,Leibo Liu,Shaojun Wei
DOI: https://doi.org/10.1587/transinf.2014rcp0004
2015-01-01
IEICE Transactions on Information and Systems
Abstract:Coarse Grained Reconfigurable Architectures (CGRAs) are promising platform based on its high-performance and low cost. Researchers have developed efficient compilers for mapping computeintensive applications on CGRA using modulo scheduling. In order to generate loop kernel, every stage of kernel are forced to have the same execution time which is determined by the critical PE. Hence non-critical PEs can decrease the supply voltage according to its slack time. The variable Dual-VDD CGRA incorporates this feature to reduce power consumption. Previous work mainly focuses on calculating a global optimal VDDL using overall optimization method that does not fully exploit the flexibility of architecture. In this brief, we adopt variable optimal VDDL in each stage of kernel concerning their pattern respectively instead of the fixed simulated global optimal VDDL. Experiment shows our proposed heuristic approach could reduce the power by 27.6% on average without decreasing performance. The compilation time is also acceptable. key words: loop mapping, software pipelining, Dual-VDD, low power, Graph Minor
What problem does this paper attempt to address?