HCRF: A Hardware Checkpoint-based Recovery Framework in Light Dual-Core Lockstep Processors

Jingzhou Li,Huaiyu Chen,Wenbin Zhang,Hu He
DOI: https://doi.org/10.1145/3649476.3658781
2024-01-01
Abstract:Lockstep processors have become enormously popular in the global market, as automotive vehicles show significant demands for microprocessors with high reliability and safety. Vendors usually develop off-core level lockstep processors which are composed of two identical microprocessors and compare their memory access-related signals to detect transient faults. However, as safety-critical applications require harsher timeliness, the detection and recovery of this solution may not be suitable. Enlightened by the checkpoint technique in out-of-order processors to realize precise exceptions, we propose a Hardware Checkpoint-based Recovery Framework (HCRF) to shorten the detection and recovery time of light, off-core lockstep processors. The recovery time of HCRF is 41.0% faster than the baseline off-core lockstep processors and HCRF could save 39.7% of the total execution energy when transient faults occur.
What problem does this paper attempt to address?