Abstract:Software has become ubiquitous in our daily lives, and with its increasing functionality and complexity comes a frequently tedious and prolonged debugging process. Of the three activities in program debugging (failure detection, fault localization, and bug fixing), the focus of this paper is on the first, failure detection, under the condition that there is no test oracle that can be used to automatically determine the success or failure of all the executions. More precisely, the outputs for many executions have to be verified manually, or the expected outputs are not even available. We want to determine whether there is a solution to help programmers predict the execution results. How good are these predicted results when they are used to help programmers find the locations of bugs? A framework is proposed to reduce the effort on output verification using a strategy based on the Hamming distance or K-Means clustering to predict results of test executions. Such data and the statement coverage of each test case are used to compute the suspiciousness of each statement according to a fault localization technique and produce a ranking for examination to locate bugs. Case studies using 22 programs and seven fault localization techniques were conducted to evaluate the fault localization effectiveness of the proposed framework on 1203 faulty versions, some of which have a single bug and others with multiple bugs. A discussion on factors that may affect the accuracy of execution result prediction and the resulting fault localization effectiveness is also presented. Our data suggests that, in general, with respect to fault localization techniques using execution results verified against the expected outputs, those using predicted execution results can be even more effective than (by examining a smaller number of statements to locate the first faulty statement) or as good as the former (the verified).

Does the Failing Test Execute a Single or Multiple Faults? an Approach to Classifying Failing Tests

A Combinatorial Testing-Based Approach to Fault Localization

Towards Interactive Fault Localization Using Test Information

Effective Software Fault Localization Using Predicted Execution Results.

A Test Restoration Method based on Genetic Algorithm for effective fault localization in multiple-fault programs

A combined passive-active method for diagnosing multiplicative fault

Searching for Multi-Fault Programs in Defects4J

Theoretical Analysis and Empirical Study on the Impact of Coincidental Correct Test Cases in Multiple Fault Localization

A Study of Modified Testing-Based Fault Localization Method

Fault Localization Based on Multi-Level Similarity of Execution Traces

A Test Suite Reduction Approach to Improving the Effectiveness of Fault Localization

An Empirical Study of Fault Localization Families and Their Combinations

Test case selection using multi-criteria optimization for effective fault localization

A Systematic Study of Failure Proximity

CFaults: Model-Based Diagnosis for Fault Localization in C Programs with Multiple Test Cases

A Comprehensive Empirical Investigation on Failure Clustering in Parallel Debugging

Test Case Prioritization Approach to Improving the Effectiveness of Fault Localization

On similarity-awareness in testing-based fault localization

New Random Testing-based Fault Localization Approach

A Survey of Automated Software Fault Localization Approach

Identifying Failure-Causing Schemas in the Presence of Multiple Faults