Tuning model parameters in class‐imbalanced learning with precision‐recall curve

Guang‐Hui Fu,Lun‐Zhao Yi,Jianxin Pan
DOI: https://doi.org/10.1002/bimj.201800148
IF: 1.715
2018-12-12
Biometrical Journal
Abstract:An issue for class-imbalanced learning is what assessment metric should be employed. So far, precision-recall curve (PRC) as a metric is rarely used in practice as compared with its alternative of receiver operating characteristic (ROC). This study investigates the performance of PRC as the evaluating criterion to address the class-imbalanced data and focuses on the comparison of PRC with ROC. The advantages of PRC over ROC on assessing class-imbalanced data are also investigated and tested on our proposed algorithm by tuning the whole model parameters in simulation studies and real data examples. The result shows that PRC is competitive with ROC as performance measurement for handling class-imbalanced data in tuning the model parameters. PRC can be considered as an alternative but effective assessment for preprocessing (such as variable selection) skewed data and building a classifier in class-imbalanced learning.
mathematical & computational biology,statistics & probability
What problem does this paper attempt to address?