Optimization to Prevent Cache Penalty by Loop Partition and Loop Unrolling

刘利,陈彧,乔林,汤志忠
DOI: https://doi.org/10.3724/sp.j.1001.2008.02228
2008-01-01
Journal of Software
Abstract:Due to the increasing speed gap between memory system and processor,cache hierarchies have been implemented into memory system,but additional latency(cache penalty)is introduced.This paper presents an algorithm named as prevent cache penalty by loop partition-unrolling(PCPLPU),which can prevent cache penalty in loops by the combination of loop partition and unrolling.Experimental results show that PCPLPU can prevent cache penalty and improve the performance of programs.
What problem does this paper attempt to address?