Comparing 'fair' machine learning models for detecting at-risk online gamblers
W. Spencer Murch Sylvia Kairouz Martin French Department of Sociology and Anthropology,Concordia University,Montreal,Quebec,CanadaSpencer Murch is a cognitive psychologist and CIHR Postdoctoral Fellow at Concordia University. His research examines the behavioural profile of addictive digital product use with the aim of developing more-effective harm prevention tools. He has published 16 journal articles and won the NCPG Annual Research and Outstanding Dissertation Awards.Sylvia Kairouz is a full professor at the Department of Sociology and Anthropology at Concordia University. She holds the FQR-SC Research Chair on Gambling,is the director of the FQR-SC HERMES research team,and is the head of the Lifestyle and Addiction Research Lab at Concordia University.Martin French is an Associate Professor with the Department of Sociology & Anthropology at Concordia University. His research examines the social dimensions of technology with an empirical focus on communications & information technology (CIT),the 'gamblification' of games,and the incorporation of addictive,gambling-like retention mechanics into digital games.
DOI: https://doi.org/10.1080/14459795.2024.2412051
2024-10-19
International Gambling Studies
Abstract:Researchers have worked to develop machine learning models that detect at-risk online gamblers, enabling personalized harm prevention tools. However, existing research has not evaluated these models' potential to reinforce or amplify sociodemographic biases leading to treatment disparity, a recognized issue in the machine learning field. We sought to develop and compare three examples of potentially fair models using online gambling data. In two large samples of transaction data from a provincially owned Canadian gambling website (N 1 = 9,145, N 2 = 10,716), we developed three machine learning models based on competing concepts of fairness: fairness via unawareness , classification parity , and outcome calibration . We hypothesized that significant relationships existed between reporting a high risk of past-year gambling problems (the dependent variable) and participants' age and sex. Further, we hypothesized that the three 'fair' models would show differing levels of classification performance both in aggregate and within sociodemographic groups. Significant age and sex effects were found, refuting the fairness via unawareness modeling strategy. Superiority across all performance metrics was not present for either of the remaining models. For the fairest practices in any jurisdiction, classification parity and outcome calibration models should be tested in situ , and incorporate the perspectives and preferences of end users who will be affected.
substance abuse