PROPERLY GREEDY CACHE PREFETCH INTEGRATED ALGORITHM IN THE PARALLEL FILE SYSTEM

LU Kai,JIN Shi-Yao,LU Xi-Cheng
DOI: https://doi.org/10.3321/j.issn:0254-4164.1999.11.010
1999-01-01
Chinese Journal of Computers
Abstract:Cache and Prefetch is two efficient ways to lower the delay of I/O request. The latency of physical I/O operations can be avoided by caching frequently referenced data in primary memory rather than in secondary memory. And latency tolerance can be achieved by prefetch's overlapping long I/O access with independent operations. But simple cache and prefetch algorithm is no use for parallel scientific application, and prefetching must be carefully balanced against caching. This paper proposes a new prefetch algorithm faced to parallel scientific application: Properly Greedy Cache Prefetch Integrated Algorithm (PGI). PGI is based on the regularity of parallel scientific application's I/O access mode. Now, LRU _SP and TIP algorithm is too greedy when they execute prefetch operation. PGI can overcome the shortcomings of LRU _SP and TIP. PGI uses properly greedy dynamic slide window, which can adjust its prefetch size according to application's consume speed and I/O service delay. When replacing the cache blocks, PGI uses cache and prefetch integrated loss estimate algorithm to achieve the least loss and least I/O service time. In parallel file system environment, the workload of each I/O node is different, PGI thinks of it fully. When replacing cache blocks, PGI will first replace those cache blocks on light workload I/O nodes. So, the workload of each I/O node will be balance, and the total service time will be lower. The result shows that the PGI can bring higher hit ration, shorter I/O operation delay and balance workload.
What problem does this paper attempt to address?