Prediction of the Healthcare Resource Utilization Using Multi-Output Regression Models

Liwen Cui,Xiaolei Xie,Zuojun Shen,Rui Lu,Haibo Wang
DOI: https://doi.org/10.1080/24725579.2018.1512537
2018-01-01
IISE Transactions on Healthcare Systems Engineering
Abstract:Abstract With the rapidly increasing healthcare cost and the scarcity of inpatient resources, it is of paramount importance to accurately predict the healthcare resource utilization. Previous research mainly focuses on predicting the healthcare cost using single-output models. However, the intensity of the healthcare resource utilization is reflected by multiple measures. For example, the Diagnosis Related Group (DRG) system adopted in China measures the healthcare resource utilization using both cost and length of stay (LoS), which motivates us to jointly predict these two measures. Compared to constructing several independent single-output models for each task, using multi-output models can provide unified prediction rules, reduce the training time, and improve the generalization by leveraging the correlations across tasks. We utilize four multi-output machine learning models, including the multi-task Lasso, the decision tree, the random forest, and the neural network. We evaluate their performance based on the Electronic Health Record (EHR) dataset with approximately 750,000 records. Based on extensive numerical experiments, we provide a guideline for model selection and construction. This research has the potential to improve the management of healthcare resources and provide decision support for healthcare payment system.
What problem does this paper attempt to address?