Design Of High Reliable Replication Cache For Data Cache Reliability

Huiqing Luo,Jianyuan Wu,Zhaolin Li
2008-01-01
Abstract:Modem high-performance microprocessors utilize multi-level cache. structures to help tolerate the increasing latency of main memory. This complicated cache structure is more vulnerable to soft error under severe environment. The widely used EDC (Error Detecting Code) and ECC (Error Correcting Code) can effectively tolerate a high variation-induced defect rate. However, ECC have limited capability in error detection and correction due to the restriction of area and latency. This paper proposes a low-delay technique to deal with soft errors with minimum impact on performance. The idea is to integrate a redundant cache (R-cache) line into L1 data cache. The R-cache line stores the replica of each "dirty" write to the corresponding L1 data cache set block. Only EDC is needed to detect errors while the original area cost of ECC can compensate the newly adopted R-cache. Early Write Back is employed to make the whole cache structure more efficient and reliable. The access time overhead is negligible because the process is operated in parallel. The technique is efficient in comparison with other solutions, for no encoding or decoding circuit is needed, while it provides much more powerful protection than single ECC method. Experiment results show that the cache system proposed in this paper can provide replicas for nearly 100% of "dirty" cache lines in L I data cache on average.
What problem does this paper attempt to address?