Statistical Thinking, Machine Learning

Jiang Bian,Iain Buchan,Yi Guo,Mattia Prosperi
DOI: https://doi.org/10.1016/j.jclinepi.2019.08.003
IF: 7.407
2019-01-01
Journal of Clinical Epidemiology
Abstract:We read with interest “A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models” by Christodoulou et al. [ [1] Christodoulou E. Ma J. Collins G.S. Steyerberg E.W. Verbakel J.Y. Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019; 110: 12-22 Abstract Full Text Full Text PDF PubMed Scopus (604) Google Scholar ] appreciating its rigor and importance. We question, however, the conceptual dichotomy of logistic regression (LR) vs. machine learning (ML). Statistics versus machine learning: definitions are interesting (but understanding, methodology, and reporting are more important)Journal of Clinical EpidemiologyVol. 116PreviewWe thank Bian et al for their interest in our study. Since the study was published, the distinction between logistic regression and machine learning has fueled a lot of discussion. We had addressed this issue already in the initial submission but corroborated on it based on the reviewers’ comments. We state in the article that we do not believe there is a clear dichotomy but rather that algorithms lie on a continuum regarding flexibility, and reliance on the data versus subject knowledge. Nevertheless, several publications and discussions explicitly make this distinction and often conclude that machine learning leads to better predictive performance compared with traditional statistical methods. Full-Text PDF
What problem does this paper attempt to address?