Development and evaluation of a risk predictive model for high level of self-reported symptom cluster dis-tress in breast cancer patients undergoing chemotherapy

WU Fulei,YUAN Changrong,YANG Yang,SHANG Meimei,YAN Rong,ZHENG Yeping,NIU Meie,HUANG Qingmei
DOI: https://doi.org/10.3761/j.issn.0254-1769.2023.06.005
2023-01-01
Abstract:Objective To develop and evaluate a risk predictive model for high level of self-reported symptom cluster distress in breast cancer patients undergoing chemotherapy. Methods 647 patients who received chemotherapy and met the inclusion and exclusion criteria were selected at 4 tertiary hospitals in Shanghai,Shandong,Jiangsu,and Zhejiang from October 2019 to May 2021. Cases were randomly assigned to a modeling group and a validation group based on 5-fold cross-validation method at a ratio of 8:2. The random forest algorithm was used to develop the risk predictive model in the modeling group. The receiver operating characteristic curve,Hosmer-Lemeshow goodness-of-fit test,calibration curve,and decision curve were used to comprehensively evaluate the prediction performance of the model in the validation group. Risk factors were identified based on the order of the importance of each influencing factors. Results The incidence of high symptom cluster distress was 33.27% in the modeling group and 29.23% in the validation group. The area under the receiver operating characteristic curve of the prediction model was 0.91;the sensitivity was 65.8%;the specificity was 93.5%;Hosmer-Lemeshow goodness-of-fit test was insignificant(P =0.1365);the decision curve was above the reference line. Body image,self-efficacy,and financial burden were the most important predictors.Conclusion The risk predictive model based on random forest algorithm has good predictive performance,which is of great significance to help identify breast cancer subgroups at high risk of symptom cluster distress,and will potentially promote symptom management.
What problem does this paper attempt to address?