An approach of processor core customization for stencil computation

Yanhua Li,Youhui Zhang,Jianfeng Yang,Wayne Luk,Guangwen Yang,Weimin Zheng
DOI: https://doi.org/10.1109/ASAP.2014.6868656
2014-01-01
Abstract:Architecture customization is believed as one of the most promising methods to meet ever-increasing computing needs and power density limitations. This paper presents an approach to enhance a preliminary customizable core with some common architecture features, to adapt to the specific applications while keeping the programming flexibility. Those features include several effective software/hardware co-optimizing strategies, such as loop tiling, pre-fetching, cache customization, customized Single Instruction Multiple Data (SIMD) and Direct Memory Access (DMA), as well as the necessary ISA extensions. Currently we select stencil computation as the research target. Detailed tests of power-efficiency to evaluate the effect of all these optimizations comprehensively shows impressive performance speedup and power efficiency, even compared to X86, GPU and FPGA platforms. All these proposed customizations here could be applied to other computing applications.
What problem does this paper attempt to address?