Development and validation of machine learning models for predicting cancer-related fatigue in lymphoma survivors

Yiming Wang,Lv Tian,Wenqiu Wang,Weiping Pang,Yue Song,Xiaofang Xu,Fengzhi Sun,Wenbo Nie,Xia Zhao,Lisheng Wang
DOI: https://doi.org/10.1016/j.ijmedinf.2024.105630
Abstract:Background: New cases of lymphoma are rising, and the symptom burden, like cancer-related fatigue (CRF), severely impacts the quality of life of lymphoma survivors. However, clinical diagnosis and treatment of CRF are inadequate and require enhancement. Objective: The main objective of this study is to construct machine learning-based CRF prediction models for lymphoma survivors to help healthcare professionals accurately identify the CRF population and better personalize treatment and care for patients. Methods: A cross-sectional study in China recruited lymphoma patients from June 2023 to March 2024, dividing them into two datasets for model construction and external validation. Six machine learning algorithms were used in this study: Logistic Regression (LR), Random Forest, Single Hidden Layer Neural Network, Support Vector Machine, eXtreme Gradient Boosting, and Light Gradient Boosting Machine (LightGBM). Performance metrics like the area under the receiver operating characteristic (AUROC) and calibration curves were compared. The clinical applicability was assessed by decision curve, and Shapley additive explanations was employed to explain variable significance. Results: CRF incidence was 40.7 % (dataset I) and 44.8 % (dataset II). LightGBM showed strong performance in training and internal validation. LR excelled in external validation with the highest AUROC and best calibration. Pain, total protein, physical function, and sleep disturbance were important predictors of CRF. Conclusion: The study presents a machine learning-based CRF prediction model for lymphoma patients, offering dynamic, data-driven assessments that could enhance the development of automated CRF screening tools for personalized management in clinical practice.
What problem does this paper attempt to address?