Fair evaluation of classifier predictive performance based on binary confusion matrix

Amalia Vanacore,Maria Sole Pellegrino,Armando Ciardiello
DOI: https://doi.org/10.1007/s00180-022-01301-9
IF: 1.4049
2022-11-30
Computational Statistics
Abstract:Evaluating the ability of a classifier to make predictions on unseen data and increasing it by tweaking the learning algorithm are two of the main reasons motivating the evaluation of classifier predictive performance. In this study the behavior of Balanced  — a novel classifier accuracy measure — is investigated under different class imbalance conditions via a Monte Carlo simulation. The behavior of Balanced is compared against that of several well-known performance measures based on binary confusion matrix. Study results reveal the suitability of Balanced with both balanced and imbalanced data sets. A real example of the effects of class imbalance on the behavior of the investigated classifier performance measures is provided by comparing the performance of several machine learning algorithms in a churn prediction problem.
statistics & probability
What problem does this paper attempt to address?