Abstract:Background:Medication nonadherence represents a major burden on national health systems. According to the World Health Organization, increasing medication adherence may have a greater impact on public health than any improvement in specific medical treatments. More research is needed to better predict populations at risk of medication nonadherence. Objective:To develop clinically informative, easy-to-interpret machine learning classifiers to predict people with psychiatric disorders at risk of medication nonadherence based on the syntactic and structural features of written posts on health forums. Methods:All data were collected from posts between 2016 and 2021 on mental health forum, administered by Together 4 Change, a long-running not-for-profit organisation based in Oxford, UK. The original social media data were annotated using the Tool for the Automatic Analysis of Syntactic Sophistication and Complexity (TAASSC) system. Through applying multiple feature optimisation techniques, we developed a best-performing model using relevance vector machine (RVM) for the probabilistic prediction of medication nonadherence among online mental health forum discussants. Results:The best-performing RVM model reached a mean AUC of 0.762, accuracy of 0.763, sensitivity of 0.779, and specificity of 0.742 on the testing dataset. It outperformed competing classifiers with more complex feature sets with statistically significant improvement in sensitivity and specificity, after adjusting the alpha levels with Benjamini-Hochberg correction procedure. Discussion. We used the forest plot of multiple logistic regression to explore the association between written post features in the best-performing RVM model and the binary outcome of medication adherence among online post contributors with psychiatric disorders. We found that increased quantities of 3 syntactic complexity features were negatively associated with psychiatric medication adherence: "dobj_stdev" (standard deviation of dependents per direct object of nonpronouns) (OR, 1.486, 95% CI, 1.202-1.838, P < 0.001), "cl_av_deps" (dependents per clause) (OR, 1.597, 95% CI, 1.202-2.122, P, 0.001), and "VP_T" (verb phrases per T-unit) (OR, 2.23, 95% CI, 1.211-4.104, P, 0.010). Finally, we illustrated the clinical use of the classifier with Bayes' monograph which gives the posterior odds and their 95% CI of positive (nonadherence) versus negative (adherence) cases as predicted by the best-performing classifier. The odds ratio of the posterior probability of positive cases was 3.9, which means that around 10 in every 13 psychiatric patients with a positive result as predicted by our model were following their medication regime. The odds ratio of the posterior probability of true negative cases was 0.4, meaning that around 10 in every 14 psychiatric patients with a negative test result after screening by our classifier were not adhering to their medications. Conclusion:Psychiatric medication nonadherence is a large and increasing burden on national health systems. Using Bayesian machine learning techniques and publicly accessible online health forum data, our study illustrates the viability of developing cost-effective, informative decision aids to support the monitoring and prediction of patients at risk of medication nonadherence.

Machine Learning (ML) Application to Predict Patient Risk of Nonadherence in Type 2 Diabetes Management Using US Claims Databases

Development and assessment of novel machine learning models to predict medication non-adherence risks in type 2 diabetics

Predictive models of medication non-adherence risks of patients with T2D based on multiple machine learning algorithms

Machine learning-based prediction of medication refill adherence among first-time insulin users with type 2 diabetes

Machine Learning Approaches to Predict Risks of Diabetic Complications and Poor Glycemic Control in Nonadherent Type 2 Diabetes.

Predicting population: development and validation of a new predictive nomogram for evaluating medication nonadherence risk in a type 2 diabetes

1233-P: Prediction of Type 2 Diabetes Occurrence Using Machine Learning Model

Predictive Modeling of Factors Influencing Adherence to SGLT-2 Inhibitors in Ambulatory Care: Insights from Prescription Claims Data Analysis

Proactive Identification of Patients with Diabetes at Risk of Uncontrolled Outcomes during a Diabetes Management Program: Conceptualization and Development Study Using Machine Learning

Peer Review #1 of "predicting Population: Development and Validation of a New Predictive Nomogram for Evaluating Medication Nonadherence Risk in a Type 2 Diabetes (V0.1)"

Machine Learning Approaches for Type 2 Diabetes Prediction and Care Management

Machine Learning to Identify Predictors of Glycemic Control in Type 2 Diabetes: An Analysis of Target HbA1c Reduction Using Empagliflozin/Linagliptin Data

1290-P: Developing Machine Learning Model for Predicting Acute Coronary Syndrome in Type 2 Diabetes Mellitus Patients through Substitution of Propensity Scores for Binary Variables

Machine learning based predictive model of Type 2 diabetes complications using Malaysian National Diabetes Registry: A study protocol

Machine Learning-Based Prediction for High Health Care Utilizers by Using a Multi-Institutional Diabetes Registry: Model Training and Evaluation

Prediction of Type 2 Diabetes Based on Machine Learning Algorithm

Using Machine Learning Techniques to Develop Risk Prediction Models for the Risk of Incident Diabetic Retinopathy Among Patients With Type 2 Diabetes Mellitus: A Cohort Study

Exploring metformin monotherapy response in Type-2 diabetes: Computational insights through clinical, genomic, and proteomic markers using machine learning algorithms

Applying Machine Learning Models to Predict Medication Nonadherence in Crohn's Disease Maintenance Therapy.

1132-P: Classification and Prediction of Diabetes Using Electronic Health Records and Wearable Devices for Clinical Decision Support

Probabilistic Prediction of Nonadherence to Psychiatric Disorder Medication from Mental Health Forum Data: Developing and Validating Bayesian Machine Learning Classifiers.