CINDA: Don't Ignore Instructions when Cloning Memory Access Behavior

Wenhai Lin,Yiquan Chen,Jiexiong Xu,Zhen Jin,Peiyu Liu,Shishun Cai,Yuzhong Zhang,Jingchang Qin,Yiquan Lin,Wenzhi Chen
DOI: https://doi.org/10.1109/ccgrid59990.2024.00063
2024-01-01
Abstract:Existing workload cloning methods suffer from low accuracy as they primarily focus on data access patterns and ignore instruction access. This limitation reduces the accuracy of shared L2 cache design exploration and impedes processor designers from optimizing Icache and ITLB designs. In this paper, we propose CINDA, a novel workload cloning technique that can Capture both INstruction and DAta access patterns of applications. In particular, CINDA separates the instruction and data traces of applications to generate proxy instruction and proxy data traces, subsequently merging them. The results show that CINDA can accurately replicate memory access behavior with 99.1%, 99.9%, and 96.2% accuracy in replicating L1 Icache, ITLB and L2 cache performance, respectively. Furthermore, CINDA outperforms the state-of-the-art methods by reducing 7.7% L2 cache miss error.
What problem does this paper attempt to address?