Fault-Tolerance CMP Architecture Based on SMT Technology

Linzhi Ning,Wenbin Yao,Jun Ni,Nianmin Yao
DOI: https://doi.org/10.1109/imsccs.2007.39
2007-01-01
Abstract:In order to improve the reliability of the single-chip multi-processor (CMP), this paper proposes a fault- tolerant CMP architecture which combines with the simultaneous multi-threading (SMT) technology so as to implement the transient fault detection and to automatically accomplish the thread-level recovery. The architecture, through adopting a simple strategies and a little extra hardware to implement the functionality of fault tolerance, attains a wider coverage of the fault and improves the performance of the fault-tolerant CMP.
What problem does this paper attempt to address?