Reliability Analysis of Ensemble Fault Tolerance for Soft Error Mitigation Against Complex Radiation Effect.

Shunkun Yang,Qi Shao,Chong Bian
DOI: https://doi.org/10.1016/j.ress.2021.108092
IF: 7.247
2021-01-01
Reliability Engineering & System Safety
Abstract:With the progressive miniaturization of integrated circuits and the increasing complexity of logic functions, the possibility of software system failure triggered by soft errors in the context of intensive space radiation is also increasing. The quantitative analysis of the reliability of different mitigation strategies against superposition affections in the context of a dynamic space radiation environment is still challenging. To solve these problems, a mixed extreme run degraded shock model is proposed to comprehensively describe the superposition of the space radiation effect, and a degradation mechanism is introduced to address the cumulative effect. Different ensemble mechanisms of structure fault tolerance, information fault tolerance, and rejuvenation strategy are modeled for quantitative analysis. Subsequently, through parameter transfer and state synchronization control technology, an interaction mechanism is established between the environment and design mode models to realize the dynamic reliability evolution analysis via probabilistic model checking. Experimental results demonstrate that the ensemble of structural fault-tolerance and information fault-tolerance has the best resistance to the space radiation effect with the reliability improvement by 6.17-20.17% under the given experimental conditions.
What problem does this paper attempt to address?