Abstract:Fault localization is a process that aims to identify the potentially faulty statements responsible for program failures by analyzing runtime information. Therefore, the input code coverage matrix plays a crucial role in FL. However, the effectiveness of fault localization is compromised by the presence of coincidental correct test cases (CCTC) in the coverage matrix. These CCTC execute faulty code but do not result in program failures. To address this issue, many existing methods focus on identifying CCTC through cluster analysis. However, these methods have three problems. Firstly, identifying the optimal cluster count poses a considerable challenge in CCTC detection. Secondly, the effectiveness of CCTC detection is heavily influenced by the initial centroid selection. Thirdly, the presence of abundant fault-irrelevant statements within the raw coverage matrix introduces substantial noise for CCTC detection. To overcome these challenges, we propose SCD4FL: a semantic context-based CCTC detection method to enhance the coverage matrix for fault localization. SCD4FL incorporates and implements two key ideas: (1) SCD4FL uses the intersection of execution slices to construct a semantic context from the raw coverage matrix, effectively reducing noise during CCTC detection. (2) SCD4FL employs an expert-knowledge-based K-nearest neighbors (KNN) algorithm to detect the CCTC, effectively eliminating the requirement of determining the cluster number and initial centroid. To evaluate the effectiveness of SCD4FL, we conducted extensive experiments on 420 faulty versions of nine benchmarks using six state-of-the-art fault localization methods and two representative CCTC detection methods. The experimental results validate the effectiveness of our method in enhancing the performance of the six fault localization methods and two CCTC detection methods, e.g., the RNN method can be improved by 53.09% under the MFR metric.

A coincidental correctness test case identification framework with fuzzy C-means clustering

Identifying Coincidental Correctness for Fault Localization by Clustering Test Cases

A Clustering-Based Strategy to Identify Coincidental Correctness in Fault Localization.

Semantic context based coincidental correct test cases detection for fault localization

Theoretical Analysis and Empirical Study on the Impact of Coincidental Correct Test Cases in Multiple Fault Localization

A Fault-Localization Approach Based on the Coincidental Correctness Probability

A Dynamic Fault Localization Technique with Noise Reduction for Java Programs

A General Noise-Reduction Framework for Fault Localization of Java Programs.

Theoretical Analysis on Fault Localization Formulas by Coincidental Correctness

Defect Detection and Identification in Eddy Current Testing Using Subtractive Clustering Algorithm Combined with Rbfnn

Improving Test Distance for Failure Clustering with Hypergraph Modelling

On similarity-awareness in testing-based fault localization

Test Adequacy Criterion Based on Coincidental Correctness Probability

A similarity-aware approach to testing based fault localization.

HuntFUZZ: Enhancing Error Handling Testing through Clustering Based Fuzzing

A possibilistic Fuzzy c-means algorithm based on improved Cuckoo search for data clustering

Adaptive Approach to Fuzzy Clustering

A Test Restoration Method based on Genetic Algorithm for effective fault localization in multiple-fault programs

A Study of Modified Testing-Based Fault Localization Method

A Combinatorial Testing-Based Approach to Fault Localization

Using Weighted Attributes to Improve Cluster Test Selection