Abstract:Background. Self-admitted technical debt (SATD) is a special kind of technical debt that is intentionally introduced and remarked by code comments. Those technical debts reduce the quality of software and increase the cost of subsequent software maintenance. Therefore, it is necessary to find out and resolve these debts in time. Recently, many automatic approaches have been proposed to identify SATD. Problem. Popular IDEs support a number of predefined task annotation tags for indicating SATD in comments, which have been used in many projects. However, such clear prior knowledge is neglected by existing SATD identification approaches when identifying SATD. Objective. We aim to investigate how far we have really progressed in the field of SATD identification by comparing existing approaches with a simple approach that leverages the predefined task tags to identify SATD. Method. We first propose a simple heuristic approach that fuzzily Matches task Annotation Tags ( MAT ) in comments to identify SATD. In nature, MAT is an unsupervised approach, which does not need any data to train a prediction model and has a good understandability. Then, we examine the real progress in SATD identification by comparing MAT against existing approaches. Result. The experimental results reveal that: (1) MAT has a similar or even superior performance for SATD identification compared with existing approaches, regardless of whether non-effort-aware or effort-aware evaluation indicators are considered; (2) the SATDs (or non-SATDs) correctly identified by existing approaches are highly overlapped with those identified by MAT ; and (3) supervised approaches misclassify many SATDs marked with task tags as non-SATDs, which can be easily corrected by their combinations with MAT . Conclusion. It appears that the problem of SATD identification has been (unintentionally) complicated by our community, i.e., the real progress in SATD comments identification is not being achieved as it might have been envisaged. We hence suggest that, when many task tags are used in the comments of a target project, future SATD identification studies should use MAT as an easy-to-implement baseline to demonstrate the usefulness of any newly proposed approach.

Prevalence, Contents and Automatic Detection of KL-SATD

SATD Detector

Neural Network-based Detection of Self-Admitted Technical Debt

Self-Admitted Technical Debts Identification: How Far Are We?

DebtFree: Minimizing Labeling Cost in Self-Admitted Technical Debt Identification using Semi-Supervised Learning

How Far Have We Progressed in Identifying Self-admitted Technical Debts? A Comprehensive Empirical Study.

An Empirical Study of Self-Admitted Technical Debt in Machine Learning Software

Quantifying and Characterizing Clones of Self-Admitted Technical Debt in Build Systems

SATDAUG -- A Balanced and Augmented Dataset for Detecting Self-Admitted Technical Debt

Improving the detection of technical debt in Java source code with an enriched dataset

Automating Just-In-Time Comment Updating

Just-In-Time Obsolete Comment Detection and Update.

Identifying self-admitted technical debt in open source projects using text mining

MAT: A Simple Yet Strong Baseline for Identifying Self-Admitted Technical Debt

Development and Adoption of SATD Detection Tools: A State-of-practice Report

A Taxonomy of Self-Admitted Technical Debt in Deep Learning Systems

Self‐admitted Technical Debt Detection by Learning Its Comprehensive Semantics Via Graph Neural Networks

Automated Detection of Algorithm Debt in Deep Learning Frameworks: An Empirical Study

A framework for conditional statement technical debt identification and description

Deep Learning and Data Augmentation for Detecting Self-Admitted Technical Debt

Self-Admitted Technical Debt Detection Approaches: A Decade Systematic Review