Error Recovery in a Real-Time Multiprocessor System

Weihua Li,Youguang Yuan
DOI: https://doi.org/10.1007/bf02946170
1992-01-01
Abstract:In this paper, a new scheme for recovering errors due to transient faults in a real-time multiprocessor system is presented. The scheme, called dynamic redundancy at the task level, is implemented in a real-time multitasking environment. Utilizing the facilities in the operating system, the scheme makes backup tasks for the primary tasks as redundancy. The paper introduces an algorithm to generate a fault tolerant schedule for the tasks so that they recover errors as retry or checkpointing does. A reliability model is proposed to evahuste the effectiveness of the scheme.
What problem does this paper attempt to address?