Decision Making with Machine Learning and ROC Curves

Kai Feng,Han Hong,Ke Tang,Jingyuan Wang
DOI: https://doi.org/10.48550/arXiv.1905.02810
2019-05-05
Methodology
Abstract:The Receiver Operating Characteristic (ROC) curve is a representation of the statistical information discovered in binary classification problems and is a key concept in machine learning and data science. This paper studies the statistical properties of ROC curves and its implication on model selection. We analyze the implications of different models of incentive heterogeneity and information asymmetry on the relation between human decisions and the ROC curves. Our theoretical discussion is illustrated in the context of a large data set of pregnancy outcomes and doctor diagnosis from the Pre-Pregnancy Checkups of reproductive age couples in Henan Province provided by the Chinese Ministry of Health.
What problem does this paper attempt to address?