Applications and Performance of Machine Learning Algorithms in Emergency Medical Services: A Scoping Review

Ahmad Alrawashdeh,Saeed Alqahtani,Zaid I. Alkhatib,Khalid Kheirallah,Nebras Y. Melhem,Mahmoud Alwidyan,Arwa M. Al-Dekah,Talal Alshammari,Ziad Nehme
DOI: https://doi.org/10.1017/s1049023x24000414
2024-05-18
Prehospital and Disaster Medicine
Abstract:Objective: The aim of this study was to summarize the literature on the applications of machine learning (ML) and their performance in Emergency Medical Services (EMS). Methods: Four relevant electronic databases were searched (from inception through January 2024) for all original studies that employed EMS-guided ML algorithms to enhance the clinical and operational performance of EMS. Two reviewers screened the retrieved studies and extracted relevant data from the included studies. The characteristics of included studies, employed ML algorithms, and their performance were quantitively described across primary domains and subdomains. Results: This review included a total of 164 studies published from 2005 through 2024. Of those, 125 were clinical domain focused and 39 were operational. The characteristics of ML algorithms such as sample size, number and type of input features, and performance varied between and within domains and subdomains of applications. Clinical applications of ML algorithms involved triage or diagnosis classification (n = 62), treatment prediction (n = 12), or clinical outcome prediction (n = 50), mainly for out-of-hospital cardiac arrest/OHCA (n = 62), cardiovascular diseases/CVDs (n = 19), and trauma (n = 24). The performance of these ML algorithms varied, with a median area under the receiver operating characteristic curve (AUC) of 85.6%, accuracy of 88.1%, sensitivity of 86.05%, and specificity of 86.5%. Within the operational studies, the operational task of most ML algorithms was ambulance allocation (n = 21), followed by ambulance detection (n = 5), ambulance deployment (n = 5), route optimization (n = 5), and quality assurance (n = 3). The performance of all operational ML algorithms varied and had a median AUC of 96.1%, accuracy of 90.0%, sensitivity of 94.4%, and specificity of 87.7%. Generally, neural network and ensemble algorithms, to some degree, out-performed other ML algorithms. Conclusion: Triaging and managing different prehospital medical conditions and augmenting ambulance performance can be improved by ML algorithms. Future reports should focus on a specific clinical condition or operational task to improve the precision of the performance metrics of ML models.
emergency medicine
What problem does this paper attempt to address?
The main objective of this paper is to summarize the literature on the application of Machine Learning (ML) algorithms in Emergency Medical Services (EMS) and their performance. Researchers collected all original studies that utilized ML algorithms under the guidance of EMS to enhance clinical and operational performance by searching four relevant electronic databases from their inception until January 2024. After screening, a total of 164 studies were included, with 125 focused on the clinical domain and 39 on the operational domain. In terms of clinical applications, ML algorithms were primarily used for classification or diagnosis (such as out-of-hospital cardiac arrest, cardiovascular diseases, and trauma), predicting treatment outcomes, and forecasting clinical endpoints. The performance of these algorithms varied, with the Area Under the Curve (AUC), accuracy, sensitivity, and specificity as evaluation metrics. The median AUC was 85.6%, accuracy was 88.1%, sensitivity was 86.05%, and specificity was 86.5%. In operational research, ML algorithms were mainly applied to tasks such as ambulance dispatch, detection, deployment, route optimization, and quality assurance, with varying performance as well. The median AUC was 96.1%, accuracy was 90.0%, sensitivity was 94.4%, and specificity was 87.7%. Neural networks and ensemble algorithms were somewhat superior to other algorithms. In summary, ML algorithms have shown potential for improvement in the classification, management, and enhancement of ambulance performance in pre-hospital medical conditions. Future research should focus on specific clinical conditions or operational tasks to improve the precision of performance metrics for ML models.