Abstract:In this edition, we present two papers that explore complementary aspects of software reliability, both leveraging different fault injection techniques, shedding light on hardware faults' impact on deep learning neural networks (DNNs) and the enhancement of mutation-based fault localization techniques. The first paper, titled 'Investigating the impact of transient hardware faults on deep learning neural network inference' by Md Hasanur Rahman, Sabuj Laskar, and Guanpeng Li, examines the intersection of hardware faults and DNN inference, particularly in safety-critical applications like autonomous vehicles and healthcare systems. As DNNs become increasingly prevalent in such domains, understanding their susceptibility to transient hardware faults becomes crucial. The authors introduce advancements in fault injection techniques, by enhancing the fault injector, TENSORFI, for TensorFlow applications, enabling scalable fault injections in modern DNN models. Through extensive experimentation and analysis, they reveal the significant impact of transient hardware faults on safety-critical applications, surpassing the influence of intrinsic algorithmic inaccuracies. Their findings underscore the importance of prioritized protection for specific regions within DNNs to enhance reliability in safety-critical contexts. The second paper, 'Delta4Ms: Improving mutation-based fault localization by eliminating mutant bias' authored by Hengyuan Liu, Zheng Li, Baolong Han, Yangtao Liu, Xiang Chen, and Yong Liu, addresses the challenges of fault localization in software debugging. While mutation-based fault localization (MBFL) has emerged as a promising technique, it suffers from inherent biases, hindering its accuracy. The authors present Delta4Ms, a novel approach that tackles mutant bias by integrating original principles inspired from signal theory. By distinguishing between desired and false signal components, Delta4Ms effectively mitigates mutant bias, leading to more accurate fault localization. Through comprehensive evaluations on real-fault programmes, Delta4Ms demonstrates increased performance compared to existing techniques, showcasing significant improvements in fault localization effectiveness while minimizing computational costs. In conclusion, these two papers provide valuable insights into ensuring software reliability and resilience in two different contexts. As demonstrated by these works, it is of high interest for STVR to further explore and use fault injection techniques, at software and hardware levels including machine-learning models, to strengthen and validate software reliability and fault tolerance techniques.

NV-DNN: Towards Fault-Tolerant DNN Systems with N-Version Programming

Improving Fault Tolerance for Reliable DNN Using Boundary-Aware Activation

There is Limited Correlation Between Coverage and Robustness for Deep Neural Networks

Reliable Memristor-based Neuromorphic Design Using Variation- and Defect-Aware Training

Enhancing Fault Resilience of QNNs by Selective Neuron Splitting

Improving Model Robustness Against Adversarial Examples with Redundant Fully Connected Layer.

Towards Enhancing Fault Tolerance in Neural Networks

Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects

Estimation of Small Failure Probability Based on Adaptive Subset Simulation and Deep Neural Network

DeepVigor+: Scalable and Accurate Semi-Analytical Fault Resilience Analysis for Deep Neural Network

A Redundant Fault-tolerant Aviation Control System Based on Deep Neural Network

Fault Tolerance Based on Neural Networks for the Intelligent Distributed Framework

Hybrid Convolutional Neural Networks with Reliability Guarantee

Enhancing Neural Network Robustness Against Fault Injection Through Non-linear Weight Transformations

Evaluating Deep Neural Networks in Deployment (A Comparative and Replicability Study)

Investigating Fault Injection Techniques in Hardware-Based Deep Neural Networks and Mutation-Based Fault Localization

Dependable Neural Networks for Safety Critical Tasks

Fail-Safe Execution of Deep Learning based Systems through Uncertainty Monitoring

Automated design of error-resilient and hardware-efficient deep neural networks

Fault-Tolerant Neural Networks from Biological Error Correction Codes

Efficient Error-Tolerant Quantized Neural Network Accelerators