Over-sampling method for tackling class imbalance in software defect prediction based on generative adversarial networks

Zhang Hengwei,Jia Xiuyi
DOI: https://doi.org/10.14177/j.cnki.32-1397n.2023.47.02.006
2023-01-01
Abstract:In order to solve the problem of class imbalance in software defect prediction(SDP),this paper proposes a novel oversampling method based on generative adversarial networks(GAN). The GAN network can make full use of the spatial relationship of the sample distribution, dig out some related information hidden between the samples, and make the generated new samples more reasonable and scientific through the alternate optimization of the generator and the discriminator. This paper conductes experiments on 26 unbalanced data sets and extensive comparisons with some oversampling methods and unsampled methods on 8 classifiers. Experimental results show that this method achieves the best results on Precision, Recall, F-measure and G-mean.
What problem does this paper attempt to address?