FSDNP:Feature Selection Method for Software Defect Number Prediction

Yefei LI,Guofei GUAN,Chonghui GE,Xiang CHEN,Chao NI,Zhuzhong QIAN
DOI: https://doi.org/10.3778/j.issn.1002-8331.1811-0103
2019-01-01
Abstract:Previous works in software defect prediction mainly focused on classifying software modules as defect-prone or not. How to quantify the number of defects in a software module has rarely been investigated. To address such issue, the paper proposes a two-stage feature selection approach for software defect number prediction FSDNP:feature clustering phase and feature selection phase. The feature clustering phase clusters highly correlated features using a density-based clustering method, and the feature selection phase removes irrelevant and redundant features from each cluster using three heuristic ranking strategies. FSDNP compares with six state-of-the-art baseline approaches using average absolute error and average relative error on PROMISE dataset. The results show FSDNP can effectively remove irrelevant and redun-dant features and build effective software defect number prediction model.
What problem does this paper attempt to address?