Abstract:Tenggao Chen, 1, &ast Wanlin Lei, 2, &ast Maofeng Wang 2 1 Department of Colorectal Surgery, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, 322100, People's Republic of China; 2 Department of Biomedical Sciences Laboratory, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, 322100, People's Republic of China &astThese authors contributed equally to this work Correspondence: Maofeng Wang, Department of Biomedical Sciences Laboratory, No. 60 Wuning West Road, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, 322100, People's Republic of China, Email Objective: This study aimed to develop a predictive model for assessing internal bleeding risk in elderly aspirin users using machine learning. Methods: A total of 26,030 elderly aspirin users (aged over 65) were retrospective included in the study. Data on patient demographics, clinical features, underlying diseases, medical history, and laboratory examinations were collected from Affiliated Dongyang Hospital of Wenzhou Medical University. Patients were randomly divided into two groups, with a 7:3 ratio, for model development and internal validation, respectively. Least absolute shrinkage and selection operator (LASSO) regression, extreme gradient boosting (XGBoost), and multivariate logistic regression were employed to develop prediction models. Model performance was evaluated using area under the curve (AUC), calibration curves, decision curve analysis (DCA), clinical impact curve (CIC), and net reduction curve (NRC). Results: The XGBoost model exhibited the highest AUC among all models. It consisted of six clinical variables: HGB, PLT, previous bleeding, gastric ulcer, cerebral infarction, and tumor. A visual nomogram was developed based on these six variables. In the training dataset, the model achieved an AUC of 0.842 (95% CI: 0.829– 0.855), while in the test dataset, it achieved an AUC of 0.820 (95% CI: 0.800– 0.840), demonstrating good discriminatory performance. The calibration curve analysis revealed that the nomogram model closely approximated the ideal curve. Additionally, the DCA curve, CIC, and NRC demonstrated favorable clinical net benefit for the nomogram model. Conclusion: This study successfully developed a predictive model to estimate the risk of bleeding in elderly aspirin users. This model can serve as a potential useful tool for clinicians to estimate the risk of bleeding in elderly aspirin users and make informed decisions regarding their treatment and management. Keywords: aspirin, bleeding, haemorrhage, predictive model, extreme gradient boosting, nomogram Aspirin is extensively utilized in the management and prevention of various diseases, particularly coronary artery disease. 1 However, the use of aspirin in elderly patients poses a challenge due to an elevated risk of bleeding. 2 Balancing the prevention of cardiovascular events with minimizing bleeding risks is a major concern. 3 Elderly individuals are prone to aspirin-induced gastric injury, 4 Recent evidence suggests that daily aspirin use does not improve survival in healthy elderly individuals (> 70 years old). Conversely, the aspirin group had a higher incidence of major hemorrhage compared to control group. 5 Several bleeding risk scores have been developed to assist in selecting appropriate treatment regimens and durations, providing valuable insights for clinical practice. 6 The PRECISE-DAPT risk score accurately predicts bleeding risk in aspirin users and has been recommended (Class IIB) for identifying high-risk patients susceptible to bleeding. 7 The bleeding score effectively stratifies bleeding and ischemic risk across diverse study populations, consistently providing benefit-risk difference stratification. 8 European guidelines emphasize a personalized approach to balancing bleeding and ischemic risks instead of a generalized strategy for aspirin use. 9 Although American and European guidelines primarily recommend PARIS and PRECISE scores, they have limitations due to variations in patient cohorts. 10,11 New clinical models have recently emerged to improve hemorrhagic event prediction, incorporating commonly used scoring systems such as CRUSADE, 12 ARC-HBR, 13 ACUITY-HORIZONS, 14 BleeMACS, 15 TIMI risk score, 16 HAS-Bled score, 17 GRACE score, 12 and CHA2DS2-VASC score. 18 These scores evaluate various clinical characteristics including coronary anatomy, surgical procedures, genotyping, lifestyle factors, and treatment adherence. 19</su -Abstract Truncated-

Developing a machine learning model for bleeding prediction in patients with cancer-associated thrombosis receiving anticoagulation therapy

Machine Learning for Prediction of Cancer-Associated Venous Thromboembolism

Development and Validation of a Practical Model to Identify Patients at Risk of Bleeding after TAVR

Development and validation of machine learning models to predict the need for haemostatic therapy in acute upper gastrointestinal bleeding

Machine learning derived model for the prediction of bleeding in dual antiplatelet therapy patients

Impact of applying machine learning to the electronic medical record on prediction of cancer-associated thrombosis.

Machine learning analysis of bleeding status in venous thromboembolism patients

Development and Validation of Machine Learning Algorithms to Predict 1-Year Ischemic Stroke and Bleeding Events in Patients with Atrial Fibrillation and Cancer

Machine Learning - Based Bleeding Risk Predictions in Atrial Fibrillation Patients on Direct Oral Anticoagulants

A deep-learning approach to predict bleeding risk over time in patients on extended anticoagulation therapy

Using machine learning to predict venous thromboembolism and major bleeding events following total joint arthroplasty

Using machine learning to predict the bleeding risk for patients with cardiac valve replacement treated with warfarin in hospitalized

Development and Validation of an ICU-Venous Thromboembolism Prediction Model Using Machine Learning Approaches: A Multicenter Study

A novel dynamic risk score to predict Clinically Significant Bleeding (CSB) after 3 months of anticoagulation in patients with incident Venous ThromboEmbolism (VTE) and without active cancer

Machine learning approach for prediction of outcomes in anticoagulated patients with atrial fibrillation

Machine Learning as a Diagnostic and Prognostic Tool for Predicting Thrombosis in Cancer Patients: A Systematic Review

In Search of the Appropriate Anticoagulant-Associated Bleeding Risk Assessment Model for Cancer-Associated Thrombosis Patients

Harnessing risk assessment for thrombosis and bleeding to optimize anticoagulation strategy in nonvalvular atrial fibrillation

Machine learning-based prediction of the post-thrombotic syndrome: Model development and validation study

Machine Learning Predicts Cancer-Associated Deep Vein Thrombosis Using Clinically Available Variables

Predictive Model of Internal Bleeding in Elderly Aspirin Users Using XGBoost Machine Learning