Investigating Fault Injection Techniques in Hardware-Based Deep Neural Networks and Mutation-Based Fault Localization
Yves Le Traon,Tao Xie
DOI: https://doi.org/10.1002/stvr.1880
2024-01-01
Software Testing Verification and Reliability
Abstract:In this edition, we present two papers that explore complementary aspects of software reliability, both leveraging different fault injection techniques, shedding light on hardware faults' impact on deep learning neural networks (DNNs) and the enhancement of mutation-based fault localization techniques. The first paper, titled 'Investigating the impact of transient hardware faults on deep learning neural network inference' by Md Hasanur Rahman, Sabuj Laskar, and Guanpeng Li, examines the intersection of hardware faults and DNN inference, particularly in safety-critical applications like autonomous vehicles and healthcare systems. As DNNs become increasingly prevalent in such domains, understanding their susceptibility to transient hardware faults becomes crucial. The authors introduce advancements in fault injection techniques, by enhancing the fault injector, TENSORFI, for TensorFlow applications, enabling scalable fault injections in modern DNN models. Through extensive experimentation and analysis, they reveal the significant impact of transient hardware faults on safety-critical applications, surpassing the influence of intrinsic algorithmic inaccuracies. Their findings underscore the importance of prioritized protection for specific regions within DNNs to enhance reliability in safety-critical contexts. The second paper, 'Delta4Ms: Improving mutation-based fault localization by eliminating mutant bias' authored by Hengyuan Liu, Zheng Li, Baolong Han, Yangtao Liu, Xiang Chen, and Yong Liu, addresses the challenges of fault localization in software debugging. While mutation-based fault localization (MBFL) has emerged as a promising technique, it suffers from inherent biases, hindering its accuracy. The authors present Delta4Ms, a novel approach that tackles mutant bias by integrating original principles inspired from signal theory. By distinguishing between desired and false signal components, Delta4Ms effectively mitigates mutant bias, leading to more accurate fault localization. Through comprehensive evaluations on real-fault programmes, Delta4Ms demonstrates increased performance compared to existing techniques, showcasing significant improvements in fault localization effectiveness while minimizing computational costs. In conclusion, these two papers provide valuable insights into ensuring software reliability and resilience in two different contexts. As demonstrated by these works, it is of high interest for STVR to further explore and use fault injection techniques, at software and hardware levels including machine-learning models, to strengthen and validate software reliability and fault tolerance techniques.