Abstract:Test smell refers to poor programming and design practices in testing and widely spreads throughout software projects. Considering test smells have negative impacts on the comprehension and maintenance of test code and even make code-under-test more defect-prone, it thus has great importance in mining, detecting, and refactoring them. Since Deursen et al. introduced the definition of “test smell”, several studies worked on discovering new test smells from test specifications and software practitioners’ experience. Indeed, many bad testing practices are “observed” by software developers during creating test scripts rather than through academic research and are widely discussed in the software engineering community (e.g., Stack Overflow) [ 70 , 94 ]. However, no prior studies explored new bad testing practices from software practitioners’ discussions, formally defined them as new test smell types, and analyzed their characteristics, which plays a bad role for developers in knowing these bad practices and avoiding using them during test code development. Therefore, we pick up those challenges and act by working on systematic methods to explore new test smell types from one of the most mainstream developers’ Q&A platforms, i.e., Stack Overflow. We further investigate the harmfulness of new test smells and analyze possible solutions for eliminating them. We find that some test smells make it hard for developers to fix failed test cases and trace their failing reasons. To exacerbate matters, we have identified two types of test smells that pose a risk to the accuracy of test cases. Next, we develop a detector to detect test smells from software. The detector is composed of six detection methods for different smell types. These detection methods are both wrapped with a set of syntactic rules based on the code patterns extracted from different test smells and developers’ code styles. We manually construct a test smell dataset from seven popular Java projects and evaluate the effectiveness of our detector on it. The experimental results show that our detector achieves high performance in precision, recall, and F1 score. Then, we utilize our detector to detect smells from 919 real-world Java projects to explore whether the six test smells are prevalent in practice. We observe that these test smells are widely spread in 722 out of 919 Java projects, which demonstrates that they are prevalent in real-world projects. Finally, to validate the usefulness of test smells in practice, we submit 56 issue reports to 53 real-world projects with different smells. Our issue reports achieve 76.4% acceptance by conducting sentiment analysis on developers’ replies. These evaluations confirm the effectiveness of our detector and the prevalence and practicality of new test smell types on real-world projects.

A Large-Scale Empirical Study of Actionable Warning Distribution Within Projects

AW4C: A Commit-Aware C Dataset for Actionable Warning Identification

ACWRecommender: A Tool for Validating Actionable Warnings with Weak Supervision

How to Find Actionable Static Analysis Warnings: A Case Study with FindBugs

Machine Learning for Actionable Warning Identification: A Comprehensive Survey

An Unsupervised Feature Selection Approach for Actionable Warning Identification.

Pre-trained Model-based Actionable Warning Identification: A Feasibility Study

The Lost World: Characterizing and Detecting Undiscovered Test Smells.

Improving actionable warning identification via the refined warning-inducing context representation

Automatic Construction of an Effective Training Set for Prioritizing Static Analysis Warnings.

An Empirical Study of Class Rebalancing Methods for Actionable Warning Identification

A Study of Static Warning Cascading Tools (Experience Paper)

SATD Detector

Automatically Inspecting Thousands of Static Bug Warnings with Large Language Model: How Far Are We?

A longitudinal study of static analysis warning evolution and the effects of PMD on software quality in Apache open source projects

Automated Unearthing of Dangerous Issue Reports.

Learning to Recognize Actionable Static Code Warnings (is Intrinsically Easy)

FineWAVE: Fine-Grained Warning Verification of Bugs for Automated Static Analysis Tools

How developers engage with static analysis tools in different contexts

Automated Static Warning Identification via Path-based Semantic Representation