WOB: novel approach to set active files checkpoint

Dan Pei,Dongsheng Wang,MeiMing Shen,Weimin Zhang
2000-01-01
Tien Tzu Hsueh Pao/Acta Electronica Sinica
Abstract:Checkpoint setting and rollback recovery of Unix process is the underlying technique of fault tolerance of distributed system and parallel environment. To save and restore the state of active file of the process is an important aspect of checkpoint setting and rollback recovery. This paper presents an approach called Write Operation Buffering (WOB) to support this capability. WOB approach buffers all the write operations after a check point until the next one, making all the operation between two checkpoints as a whole. By choosing a suitable size dynamically for memory buffer, and by hiding the latency of flushing the buffer, the WOB approach achieves an overhead lower than other approaches. approaches
What problem does this paper attempt to address?