The EFTOS Voting Farm: A Software Tool for Fault Masking in Message Passing Parallel Environments

Vincenzo De Florio,Greet Deconinck,Rudy Lauwereins
DOI: https://doi.org/10.48550/arXiv.1401.2920
2014-01-14
Abstract:We present a set of C functions implementing a distributed software voting mechanism for EPX or similar message passing environments, and we place it within the EFTOS framework (Embedded Fault-Tolerant Supercomputing, ESPRIT-IV Project 21012) of software tools for enhancing the dependability of a user application. The described mechanism can be used for instance to implement restoring organs i.e., N-modular redundancy systems with N-replicated voters. We show that, besides structural design goals like fault transparency, this tool achieves replication transparency, a high degree of flexibility and ease-of-use, and good performance.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?