Effort-Aware semi-Supervised just-in-Time defect prediction

Weiwei Li,Wenzhou Zhang,Xiuyi Jia,Zhiqiu Huang
DOI: https://doi.org/10.1016/j.infsof.2020.106364
IF: 3.9
2020-10-01
Information and Software Technology
Abstract:<p><em>Context:</em> Software defect prediction is an important technique that can help practitioners allocate their quality assurance efforts. In recent years, just-in-time (JIT) defect prediction has attracted considerable interest, as it enables developers to identify risky changes at check-in time. <em>Objective:</em> Many studies have conducted research from supervised and unsupervised perspectives. A model that does not rely on label information would be preferred. However, the performance of unsupervised models proposed by previous studies in the classification scenario was unsatisfactory due to the lack of supervised information. Furthermore, most supervised models fail to outperform simple unsupervised models in the ranking scenario. To overcome this weakness, we conduct research from the semi-supervised perspective that only requires a small quantity of labeled data for training. <em>Method:</em> In this paper, we propose a semi-supervised model for JIT defect prediction named Effort-Aware Tri-Training (EATT), which is an effort-aware method using a greedy strategy to rank changes. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rate. <em>Results:</em> The experimental results on six open-source projects demonstrate that EATT outperforms existing supervised and unsupervised models for effort-aware JIT defect prediction, and has similar or superior performance in classifying defect-inducing changes. <em>Conclusion:</em> The results show that EATT can not only achieve high classification accuracy as supervised models, but also offer more practical value than other compared models from the perspective of the effort needed to review changes.</p>
computer science, information systems, software engineering
What problem does this paper attempt to address?