Predicting the Stay Length of Patients in Hospitals using Convolutional Gated Recurrent Deep Learning Model

Mehdi Neshat,Michael Phipps,Chris A. Browne,Nicole T. Vargas,Seyedali Mirjalili
2024-09-26
Abstract:Predicting hospital length of stay (LoS) stands as a critical factor in shaping public health strategies. This data serves as a cornerstone for governments to discern trends, patterns, and avenues for enhancing healthcare delivery. In this study, we introduce a robust hybrid deep learning model, a combination of Multi-layer Convolutional (CNNs) deep learning, Gated Recurrent Units (GRU), and Dense neural networks, that outperforms 11 conventional and state-of-the-art Machine Learning (ML) and Deep Learning (DL) methodologies in accurately forecasting inpatient hospital stay duration. Our investigation delves into the implementation of this hybrid model, scrutinising variables like geographic indicators tied to caregiving institutions, demographic markers encompassing patient ethnicity, race, and age, as well as medical attributes such as the CCS diagnosis code, APR DRG code, illness severity metrics, and hospital stay duration. Statistical evaluations reveal the pinnacle LoS accuracy achieved by our proposed model (CNN-GRU-DNN), which averages at 89% across a 10-fold cross-validation test, surpassing LSTM, BiLSTM, GRU, and Convolutional Neural Networks (CNNs) by 19%, 18.2%, 18.6%, and 7%, respectively. Accurate LoS predictions not only empower hospitals to optimise resource allocation and curb expenses associated with prolonged stays but also pave the way for novel strategies in hospital stay management. This avenue holds promise for catalysing advancements in healthcare research and innovation, inspiring a new era of precision-driven healthcare practices.
Neural and Evolutionary Computing,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to predict the length of stay (LoS) of hospital patients. Accurate prediction of LoS is crucial for optimizing hospital resource allocation, reducing costs and improving the efficiency of medical services. However, due to the diversity of patient characteristics, treatment complexity and social factors, the prediction of LoS is highly uncertain, and the LoS of different patients varies greatly (ranging from 2 days to more than 50 days), which makes it very challenging to develop a high - precision prediction model. To solve this problem, the author proposes a hybrid deep - learning model, which combines multi - layer convolutional neural networks (CNN), gated recurrent units (GRU) and dense neural networks (DNN). This model aims to handle the non - linear relationships of patient flow in the hospital environment and improve the accuracy of LoS prediction by integrating spatial and temporal information. ### Specific problem description 1. **Importance of LoS prediction**: - Accurate prediction of LoS helps hospitals optimize resource allocation and reduce additional costs caused by long - term hospitalization. - Improve the quality of patient care and ensure the effective use of medical resources. - Support the formulation of public health strategies and help the government identify trends and improve medical services. 2. **Limitations of existing methods**: - Traditional classification and regression models have limitations in dealing with LoS prediction, especially in the face of skewed (right - skewed) data distribution and unbalanced data sets. - A single machine - learning or deep - learning model is difficult to capture complex spatio - temporal dependencies. 3. **Proposed solution**: - Build a hybrid deep - learning model that combines the advantages of CNN, GRU and DNN to meet the above challenges. - Use large - scale data sets (such as the New York State SPARCS data set) for training and validation to ensure the generalization ability of the model. - Evaluate the performance of the model through 10 - fold cross - validation and compare it with 11 traditional and advanced machine - learning and deep - learning models. ### Main components of the model - **CNN (Convolutional Neural Network)**: Used to extract spatial features in the input data. - **GRU (Gated Recurrent Unit)**: Processes time - series data and captures time - dependent relationships. - **DNN (Dense Neural Network)**: Used for the final regression prediction, integrating the features extracted by the previous two parts. Through this hybrid architecture, the model can perform excellently in dealing with complex LoS prediction tasks, with an average accuracy rate of 89%, significantly better than other methods. ### Summary This paper aims to solve the existing challenges in current LoS prediction by introducing a new hybrid deep - learning model, thereby providing more accurate support for hospital management and medical services.