Fault Tolerant Framework in MPI-based Distributed DEVS Simulation

bin chen,Xiao-gang Qiu,Ke-di Huang
2009-01-01
Abstract:Distributed DEVS simulation plays an important role in solving complex problems for its reuseability, and composability of component models. Using MPI to be the communication middleware, the distribution increases the performance. But even the tiny faults of computing resources can lead to crash. Hence Fault Tolerant is necessary to maintain the simulation reliability. This paper introduces a DEVS framework supported Fault Tolerant. The optimistic distributed simulators implement the distribution in DEVS simulation. Fault Detection, States Storage and Fault Recovery are integrated into the framework to avoid crash at runtime. Experiments are carried out to find the optimal Timeout for Fault Tolerant framework. The results indicate that the framework has to be adjusted along with the changing of simulation requirements. Keywords: DEVS, Fault Tolerance, Fault Detection, States Storage, Fault Recovery
What problem does this paper attempt to address?