Index-based quasi-synchronous checkpointing protocol

YuanSheng Luo,Yinghua Min,Dafang Zhang
2005-01-01
Jisuanji Xuebao/Chinese Journal of Computers
Abstract:To provide rollback-recovery for fault-tolerance in distributed systems, it is significant to reduce the number of checkpoints under the existence of consistent global checkpoints in index-based distributed checkpointing algorithms. A new checkpointing protocol is presented in this paper on the basis of index-based checkpointing protocols. It not only reduces the number of forced-checkpoints but also keeps synchronous in time to avoid too much amount of overhead of roll-back recovery due to useful computation losing in case of failure. Simulation results show that the proposal algorithm in this paper can reduce the number of induced forced-checkpoints per message 23.2% on an average comparing to the traditional strategies.
What problem does this paper attempt to address?