ABMLP: Attention-Based Multi-Layer Perceptron Prefetcher

Juan Fang,Xin Lv,Huayi Cai
DOI: https://doi.org/10.1145/3577530.3577579
2022-01-01
Abstract:Cache prefetching is a traditional way to reduce memory access latency. Machine learning algorithms have shown the potential to accurately predict the future addresses of memory accesses. The existing approach is rule-based prefetchers. Rule-based prefetchers are effective at predicting common access patterns, but they are inefficient in complex cross-page access patterns. Therefore, we propose ABMLP prefetcher, which uses the attention method to learn the mapping relationship between network input and output, and assigns the corresponding weight. We used two prefetchers, MLP prefetcher and BO prefetcher, to cope with different access patterns. Overall, ABMLP prefetcher improved 19.85% overall coverage accuracy and 18.5% IPC improvement over baseline.
What problem does this paper attempt to address?