Hospital readmission prediction with hybrid‐sampling and self‐paced balance learning

Ying Xu,Ning Zhang,Ao Wang,Tao Feng,Guodong Du
DOI: https://doi.org/10.1002/cpe.8155
2024-05-28
Concurrency and Computation Practice and Experience
Abstract:Summary Hospital readmission prediction is defined as an evaluation task to model the historical medical data to predict whether patients will be readmitted after discharge. In the past few years, many feasible and effective prediction methods have been proposed, however, most of them neglect the imbalanced distribution of medical data, which causes great difficulties in modeling. Thus, we proposed a new hospital readmission prediction method, which utilizes hybrid‐sampling and self‐paced balance learning strategies to solve the class‐imbalance problem. To be specifically, we first employ an interference negative sample deletion strategy to reduce the probability of important majority class samples being deleted. Then, we design a hard positive sample generation strategy to generate more positive samples. Meanwhile, we also introduce a self‐paced balance factor during the oversampling process to improve the similarity between newly generated minority class samples and hard positive samples. Finally, we perform the experiments on six real‐world readmission datasets to indicate the superiority of the proposed method.
computer science, theory & methods, software engineering
What problem does this paper attempt to address?