Abstract:Identifying the location of faults effectively and accurately is highly important in the debugging process of software engineering. Coverage-based Fault Localization (CBFL) has been widely studied that can alleviate the effort of developers to find the faults position using the execution information of test cases. Coincidental Correct (CC) test cases are the specific test cases that execute the faulty statements but with a correct output, which have been illustrated with a negative effect on the accuracy of CBFL. In this paper, we propose a weighted fuzzy classification approach to identify CC test cases and three fuzzy strategies are suggested to manipulate CC test cases for CBFL. Firstly, we present a simple but efficient approach to identify some CC test cases for single fault programs, which provide labeled samples that enable the application of supervised classification algorithms for CC identification. Then, a Fuzzy Weighted K-Nearest Neighbor (FW-KNN) algorithm is proposed to classify potential CC from the passed test cases, in which a 'weighted' similarity measure and a "weighted" CC probability computation are presented. Finally, three fuzzy CC test cases manipulation strategies are presented to mitigate the impact of CC test cases in CBFL Various empirical studies are conducted on 190 faulty versions of 12 programs to investigate the impact of "weighted" and "fuzzy" methods for CC identification by the comparison of the effectiveness and efficiency between FW-KNN and three popular cluster and classification techniques. The results indicate that the proposed FW-KNN has higher accuracy and lower time cost. The Precision, Recall and False Positive Rate of FW-KNN is 96.47%, 83.40% and 2.85%, respectively. Besides, by utilizing code block coverage, the time cost can be reduced by 72.97% in average compared to statement coverage. The experimental results also indicate that the fault localization accuracy of CBFL can be improved by the proposed CC test cases manipulation strategies. (C) 2019 Elsevier Inc. All rights reserved.

Combining Coverage and Expert Features with Semantic Representation for Coincidental Correctness Detection

Unifying Defect Prediction, Categorization, and Repair by Multi-Task Deep Learning

Towards More Precise Coincidental Correctness Detection with Deep Semantic Learning

NeuralCCD: Integrating Multiple Features for Neural Coincidental Correctness Detection

Contrastive Coincidental Correctness Representation Learning

Semantic context based coincidental correct test cases detection for fault localization

Identifying Coincidental Correct Test Cases with Multiple Features Extraction for Fault Localization

Machine Learning Driven Identification of Coincidental Correct Test Cases

Theoretical Analysis and Empirical Study on the Impact of Coincidental Correct Test Cases in Multiple Fault Localization

A Fault-Localization Approach Based on the Coincidental Correctness Probability

A Weighted Fuzzy Classification Approach to Identify and Manipulate Coincidental Correct Test Cases for Fault Localization.

MCFL: Improving Fault Localization by Differentiating Missing Code and Other Faults

Improving MC/DC and Fault Detection Strength Using Combinatorial Testing.

Combined Classifier for Cross-Project Defect Prediction: an Extended Empirical Study.

Identify Coincidental Correct Test Cases Based on Fuzzy Classification

A Study of Enhanced MC/DC Coverage Criterion for Software Testing.

A Clustering-Based Strategy to Identify Coincidental Correctness in Fault Localization.

A Weight-based Approach to Combinatorial Test Generation

Coverage-enhanced fault diagnosis for Deep Learning programs: A learning-based approach with hybrid metrics

Regression Identification of Coincidental Correctness via Weighted Clustering

A fault localization method based on model combination