The curious case of the test set AUROC

Michael Roberts,Alon Hazan,Sören Dittmer,James H. F. Rudd,Carola-Bibiane Schönlieb
DOI: https://doi.org/10.1038/s42256-024-00817-7
IF: 23.8
2024-04-05
Nature Machine Intelligence
Abstract:The area under the receiver operating characteristic curve (AUROC) of the test set is used throughout machine learning (ML) for assessing a model's performance. However, when concordance is not the only ambition, this gives only a partial insight into performance, masking distribution shifts of model outputs and model instability.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?