Abstract:Background: Digital just-in-time adaptive interventions can reduce binge-drinking events (BDEs; consuming ≥4 drinks for women and ≥5 drinks for men per occasion) in young adults but need to be optimized for timing and content. Delivering just-in-time support messages in the hours prior to BDEs could improve intervention impact. Objective: We aimed to determine the feasibility of developing a machine learning (ML) model to accurately predict future , that is, same-day BDEs 1 to 6 hours prior BDEs , using smartphone sensor data and to identify the most informative phone sensor features associated with BDEs on weekends and weekdays to determine the key features that explain prediction model performance. Methods: We collected phone sensor data from 75 young adults (aged 21 to 25 years; mean 22.4, SD 1.9 years) with risky drinking behavior who reported their drinking behavior over 14 weeks. The participants in this secondary analysis were enrolled in a clinical trial. We developed ML models testing different algorithms (eg, extreme gradient boosting [XGBoost] and decision tree) to predict same-day BDEs (vs low-risk drinking events and non-drinking periods) using smartphone sensor data (eg, accelerometer and GPS). We tested various "prediction distance" time windows (more proximal: 1 hour; distant: 6 hours) from drinking onset. We also tested various analysis time windows (ie, the amount of data to be analyzed), ranging from 1 to 12 hours prior to drinking onset, because this determines the amount of data that needs to be stored on the phone to compute the model. Explainable artificial intelligence was used to explore interactions among the most informative phone sensor features contributing to the prediction of BDEs. Results: The XGBoost model performed the best in predicting imminent same-day BDEs, with 95% accuracy on weekends and 94.3% accuracy on weekdays ( F 1 -score=0.95 and 0.94, respectively). This XGBoost model needed 12 and 9 hours of phone sensor data at 3- and 6-hour prediction distance from the onset of drinking on weekends and weekdays, respectively, prior to predicting same-day BDEs. The most informative phone sensor features for BDE prediction were time (eg, time of day) and GPS-derived features, such as the radius of gyration (an indicator of travel). Interactions among key features (eg, time of day and GPS-derived features) contributed to the prediction of same-day BDEs. Conclusions: We demonstrated the feasibility and potential use of smartphone sensor data and ML for accurately predicting imminent (same-day) BDEs in young adults. The prediction model provides "windows of opportunity," and with the adoption of explainable artificial intelligence, we identified "key contributing features" to trigger just-in-time adaptive intervention prior to the onset of BDEs, which has the potential to reduce the likelihood of BDEs in young adults. Trial Registration: ClinicalTrials.gov NCT02918565; https://clinicaltrials.gov/ct2/show/NCT02918565

Patterns of high-risk drinking among medical students: A web-based survey with machine learning

Alcohol abuse and dependence among Brazilian medical students: Association to sociodemographic variables, anxiety and depression

Assessment of the Impact of Alcohol Consumption Patterns on Heart Rate Variability by Machine Learning in Healthy Young Adults

Machine‐learning prediction of adolescent alcohol use: a cross‐study, cross‐cultural validation

Alcohol consumption habits and their impact on academic performance: analysis of ethanol patterns among health students. A cross-sectional study

Assessing Alcohol Use Disorder: Insights from Lifestyle, Background, and Family History with Machine Learning Techniques

A machine learning model for the prediction of unhealthy alcohol use among women of childbearing age in Alabama

Craving for a Robust Methodology: A Systematic Review of Machine Learning Algorithms on Substance-Use Disorders Treatment Outcomes

A risk algorithm that predicts alcohol use disorders among college students

Validation of Alcohol Use Disorders Identification Test (AUDIT) in Brazilian Colleges: Network Analysis, Measurement Invariance and Screening Efficiency

Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study

Machine learning prediction of blood alcohol concentration: a digital signature of smart-breathalyzer behavior

Potential suicide risk among the college student population: machine learning approaches for identifying predictors and different students' risk profiles

Development and evaluation of a risk algorithm predicting alcohol dependence after early onset of regular alcohol use

Applicability of machine learning algorithm to predict the therapeutic intervention success in Brazilian smokers

Neuropsychosocial Profiles of Current and Future Adolescent Alcohol Misusers

Machine learning algorithms to the early diagnosis of fetal alcohol spectrum disorders

Drug use among medical students in São Paulo, Brazil: a cross-sectional study during the coronavirus disease 2019 pandemic

Multimodal-based machine learning approach to classify features of internet gaming disorder and alcohol use disorder: A sensor-level and source-level resting-state electroencephalography activity and neuropsychological study

Leveraging Mobile Phone Sensors, Machine Learning, and Explainable Artificial Intelligence to Predict Imminent Same-Day Binge-drinking Events to Support Just-in-time Adaptive Interventions: Algorithm Development and Validation Study

Alcohol and Drug Use in European University Health Science Students: Relationship with Self-Care Ability