Abstract:Test smell refers to poor programming and design practices in testing and widely spreads throughout software projects. Considering test smells have negative impacts on the comprehension and maintenance of test code and even make code-under-test more defect-prone, it thus has great importance in mining, detecting, and refactoring them. Since Deursen et al. introduced the definition of “test smell”, several studies worked on discovering new test smells from test specifications and software practitioners’ experience. Indeed, many bad testing practices are “observed” by software developers during creating test scripts rather than through academic research and are widely discussed in the software engineering community (e.g., Stack Overflow) [ 70 , 94 ]. However, no prior studies explored new bad testing practices from software practitioners’ discussions, formally defined them as new test smell types, and analyzed their characteristics, which plays a bad role for developers in knowing these bad practices and avoiding using them during test code development. Therefore, we pick up those challenges and act by working on systematic methods to explore new test smell types from one of the most mainstream developers’ Q&A platforms, i.e., Stack Overflow. We further investigate the harmfulness of new test smells and analyze possible solutions for eliminating them. We find that some test smells make it hard for developers to fix failed test cases and trace their failing reasons. To exacerbate matters, we have identified two types of test smells that pose a risk to the accuracy of test cases. Next, we develop a detector to detect test smells from software. The detector is composed of six detection methods for different smell types. These detection methods are both wrapped with a set of syntactic rules based on the code patterns extracted from different test smells and developers’ code styles. We manually construct a test smell dataset from seven popular Java projects and evaluate the effectiveness of our detector on it. The experimental results show that our detector achieves high performance in precision, recall, and F1 score. Then, we utilize our detector to detect smells from 919 real-world Java projects to explore whether the six test smells are prevalent in practice. We observe that these test smells are widely spread in 722 out of 919 Java projects, which demonstrates that they are prevalent in real-world projects. Finally, to validate the usefulness of test smells in practice, we submit 56 issue reports to 53 real-world projects with different smells. Our issue reports achieve 76.4% acceptance by conducting sentiment analysis on developers’ replies. These evaluations confirm the effectiveness of our detector and the prevalence and practicality of new test smell types on real-world projects.

An Empirical Study of False Negatives and Positives of Static Code Analyzers From the Perspective of Historical Issues

The Lost World: Characterizing and Detecting Undiscovered Test Smells.

Mitigating False Positive Static Analysis Warnings: Progress, Challenges, and Opportunities

"Automated Debugging Considered Harmful" Considered Harmful A User Study Revisiting the Usefulness of Spectra-Based Fault Localization Techniques with Professionals Using Real Bugs from Large Systems

Find bugs in static bug finders

Statfier: Automated Testing of Static Analyzers Via Semantic-Preserving Program Transformations

Understanding and Detecting Annotation-Induced Faults of Static Analyzers

An Approach to Detecting Bugs in Pattern-Based Bug Detectors

Finding and Understanding Defects in Static Analyzers by Constructing Automated Oracles

Towards Understanding Fixes of SonarQube Static Analysis Violations: A Large-Scale Empirical Study

Automated Program-Semantic Defect Repair and False-Positive Elimination without Side Effects

Mitigating Access Control Vulnerabilities through Interactive Static Analysis.

Which Defect Should Be Fixed First? Semantic Prioritization of Static Analysis Report.

"False negative -- that one is going to kill you": Understanding Industry Perspectives of Static Analysis based Security Testing

False Positive Elimination in Suspected Code Fault Automatic Confirmation

Static Analyzers and Potential Future Research Directions for Scala: An Overview

Validating Static Warnings via Testing Code Fragments

Evaluating C/C++ Vulnerability Detectability of Query-Based Static Application Security Testing Tools

Reducing False Positives of Static Bug Detectors Through Code Representation Learning

Canalyze: a Static Bug-Finding Tool for C Programs.

Efficacy of static analysis tools for software defect detection on open-source projects