CCP: Configuration Context Based Prefetching to Improve Coarse-Grained Reconfigurable Array Performance.

Chen Yang,Jia Hou,Yizhou Wang,Qi Zhou,Li Geng
DOI: https://doi.org/10.1109/icecs46596.2019.8964688
2019-01-01
Abstract:Coarse-grained reconfigurable arrays (CGRA) with high performance cache prefetching can mitigate off-chip memory latency. Taking advantage of the iteration number of configuration context running on CGRA, this paper proposes a configuration context based prefetching (CCP) mechanism for CGRA. CCP utilizes the first 256 memory accesses to obtain best offset of demand addresses to base address, and then associates it with configuration context to generate prefetching addresses on a cache miss. Moreover, to reduce bandwidth consumption caused by inaccurate prefetching, CCP dynamically adjusts prefetch degree according to configuration context and the current accuracy. Choosing 13 dwarfs as benchmarks, experimental results show that the proposed CCP mechanism can effectively improve CGRA performance, averagely improving by 35.23% compared to LRU (no-prefetching baseline), by 12.98% compared to PACMan and by 10.41% compared to SBP, respectively.
What problem does this paper attempt to address?