DPHM: A Fault Detection Protocol Based on Heartbeat of Multiple Master-nodes

Dong Jian,Zuo Decheng,Liu Hongwei,Yang Xiaozong
DOI: https://doi.org/10.1007/s11767-006-0122-5
2007-01-01
Abstract:In most of fault detection algorithms of distributed system, fault model is restricted to fault of process, and link failure is simply masked, or modeled by process failure. Both methods can soon use up system resource and potentially reduce the availability of system. A fault Detection Protocol based on Heartbeat of multiple Master-nodes (DPHM) is proposed, which can immediately and accurately detect and locate faulty links by adopting voting and electing mechanism among master-nodes. Thus, DPHM can effectively improve availability of system. In addition, in contrast with other detection protocols, DPHM reduces greatly the detection cost due to the structure of master-nodes.
What problem does this paper attempt to address?