Effort-Aware Tri-Training for Semi-supervised Just-in-Time Defect Prediction.

Wenzhou Zhang,Weiwei Li,Xiuyi Jia
DOI: https://doi.org/10.1007/978-3-030-16145-3_23
2019-01-01
Abstract:In recent years, just-in-time (JIT) defect prediction has gained considerable interest as it enables developers to identify risky changes at check-in time. Previous studies tried to conduct research from both supervised and unsupervised perspectives. Since the label of change is hard to acquire, it would be more desirable for applications if a prediction model doesn't highly rely on the label information. However, the performance of the unsupervised models proposed by previous work isn't good in terms of precision and F1 due to the lack of supervised information. To overcome this weakness, we try to study the JIT defect prediction from the semi-supervised perspective, which only requires a few labeled data for training. In this paper, we propose an Effort-Aware Tri-Training (EATT) semi-supervised model for JIT defect prediction based on sample selection. We compare EATT with the state-of-the-art supervised and unsupervised models with respect to different labeled rates. The experimental results on six open-source projects demonstrate that EATT performs better than existing supervised and unsupervised models for effort-aware JIT defect prediction.
What problem does this paper attempt to address?