A Fault-Tolerant Single-Chip Multiprocessor

Wenbin Yao,Dongsheng Wang,Weimin Zheng
DOI: https://doi.org/10.1007/978-3-540-30102-8_12
2004-01-01
Abstract:The microprocessor is a crucial component of a reliable system. With improvement in semiconductor manufacturing, more and more transistors may be integrated into a single chip with increased potential detriment to dependability. Fault-tolerant single-chip multiprocessors offer an ideal architecture for achieving high availability while maintaining high performance. The design of a fault-tolerant single-chip multiprocessor is described - from hardware redundancy to software support and firmware information strategies. The design aims at masking the influences of errors and automatically correcting system states, which differs from traditional approaches which mainly target errors in the memory and I/O subsystems. Dynamic recovery and reconfiguration are also described to provide adequate protection from catastrophic failure of the system.
What problem does this paper attempt to address?