Virtual Container Based Consistent Cluster Checkpoint

Xiao-jia Xiang,Hong-liang Yu,Ji-wu Shu
DOI: https://doi.org/10.1007/978-3-642-10741-2_9
2009-01-01
Abstract:Checkpoint can store and recovery applications when faults happen and is becoming critical to large information systems Unfortunately. existing checkpoint tools have some limitations such as not transparent to applications, ignoring file system states, cluster checkpoint is not well supported. and so on We present a light weight virtual container based cluster checkpoint Firstly, a virtual container, IPG (Isolated Process Group). is designed to wrap all target applications together and produce checkpoint transparently and completely Secondly. each IPG has its independent namespace built on an exclusively owned LV (Logical Volume), which can be checkpointed synchronously with the IPG's memory to guarantee the consistency. Finally. distributed applications can be deployed on many IPGs and a cluster checkpoint protocol is presented to orchestrate all IPGs to produce global checkpoints Experiments and evaluations results illustrate that no overhead will be introduced for applications running in IPGs, and our prototype system works more stable than the library base checkpoint tools
What problem does this paper attempt to address?