Strengthening Machine Learning Reproducibility for Image Classification

Guofan Shao,Hao Zhang,Jinyuan Shao,Keith Woeste,Lina Tang
DOI: https://doi.org/10.54364/aaiml.2022.1132
2022-01-01
Advances in Artificial Intelligence and Machine Learning
Abstract:Machine learning (ML) reproducibility needs to be informed with reliable evaluation measures. However, routine image classification is evaluated using metrics that are highly sensitive to class prevalence. Consequently, the reproducibility of ML models remains unclear due to class imbalance-induced noise. We suggest regularly using class imbalance-resistant evaluation metrics, including balanced accuracy, area under precision-recall curve, and image classification efficacy, for the evaluation of the reproducibility of ML models. Each of these evaluation metrics is conceptually consistent with and logically complements the others, and their joint use can help explain different aspects of classification performance at the whole-class level and individual class level. These metrics can be used for the validation, testing, and/or transfer of ML classifiers. Comprehensive analysis using these metrics as a routine approach strengthens the reproducibility of ML models.
English Else
What problem does this paper attempt to address?