Toward Resilient Algorithms and Applications

Michael A. Heroux
DOI: https://doi.org/10.48550/arXiv.1402.3809
2014-03-14
Abstract:Over the past decade, the high performance computing community has become increasingly concerned that preserving the reliable, digital machine model will become too costly or infeasible. In this paper we discuss four approaches for developing new algorithms that are resilient to hard and soft failures.
Mathematical Software,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?