Abstract:The timely stratification of trauma injury severity can enhance the quality of trauma care but it requires intense manual annotation from certified trauma coders. There is a need to establish an automated tool to identify the severity of trauma injuries across various body regions. We gather trauma registry data from a Level I Trauma Center at the University of Wisconsin-Madison (UW Health) between 2015 and 2019. Our study utilizes clinical documents and structured electronic health records (EHR) variables linked with the trauma registry data to create two machine learning models with different approaches to representing text. The first one fuses concept unique identifiers (CUIs) extracted from free text with structured EHR variables, while the second one integrates free text with structured EHR variables. Both models demonstrate impressive performance in categorizing leg injuries, achieving high accuracy with macro-F1 scores of around 0.8. Additionally, they show considerable accuracy, with macro- F1 scores exceeding 0.6, in assessing injuries in the areas of the chest and head. Temporal validation is conducted to ensure the models’ temporal generalizability. We show in our variable importance analysis that the most important features in the model have strong face validity in determining clinically relevant trauma injuries.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: by developing multi - modal and multi - category machine learning models, automatically stratify the severity of trauma injuries, in order to reduce the need for manual annotation by certified trauma coders and improve the quality of trauma care. Specifically, the research aims to utilize clinical documents and structured electronic health record (EHR) variables to create automated tools that can accurately predict the severity of trauma in multiple body regions, especially perform well in trauma assessment of multiple key parts such as legs, chests, and heads. ### Background and Motivation Trauma is the leading cause of death among people under 45 years old, resulting in more than 3.5 million hospitalizations in the United States every year. Trauma registry systems play a crucial role in improving trauma care and its related clinical outcomes because they can clarify injury patterns and identify areas for improvement. However, early assessment of trauma severity requires certified trauma coders to use software tools to analyze EHR, which is a time - consuming and labor - intensive process. In addition, trauma scores are usually recorded after the patient is discharged, limiting their usefulness during the patient's active treatment. Therefore, developing solutions that can automatically stratify trauma scores during the course of care is of great significance, which can not only achieve more comprehensive and timely data capture but also enhance the scalability of different centers. ### Research Methods The research team collected trauma registry data from 2015 to 2019 at the Level 1 Trauma Center at the University of Wisconsin - Madison. They developed two machine learning models to handle the following two text representation methods respectively: 1. **CUIs + Structured EHR Model**: Merge the Concept Unique Identifiers (CUIs) extracted from free - text with structured EHR variables. 2. **Free - text + Structured EHR Model**: Directly process free - text and merge it with structured EHR variables. ### Model Architecture - **Text Encoding**: - **CUIs + Structured EHR Model**: Use a one - dimensional convolutional neural network (1D CNN) to encode CUIs. - **Free - text + Structured EHR Model**: Use fine - tuned ClinicalBERT to encode free - text. - **Structured EHR Encoding**: Use a pre - trained fully - connected neural network to encode structured EHR variables. - **Multi - task Neural Network**: Pre - train a multi - task neural network for a binary classification task and use its shared layer as an encoder for structured EHR data. - **Fusion and Prediction**: After fusing CUI and free - text embeddings with structured EHR data, perform multi - category prediction through a multi - layer perceptron (MLP). ### Main Results - **Performance Evaluation**: Both models perform well in leg trauma stratification, with a macro - F1 score close to 0.8; in the evaluation of the chest, abdomen and spine (chest abdspine) and the head, face and neck (head faceneck) regions, the macro - F1 score also exceeds 0.6. - **Contribution Analysis**: Structured EHR data has the greatest contribution in the arm and extremities (arm ext) regions, especially in the CUIs + Structured EHR model. ### Conclusion This research has successfully developed two multi - modal machine learning models that can automatically stratify the severity of trauma injuries in multiple body regions. These models not only improve the accuracy and efficiency of trauma assessment but also provide strong support for future automated trauma care.

Automated stratification of trauma injury severity across multiple body regions using multi-modal, multi-class machine learning models

Machine Intelligence for Outcome Predictions of Trauma Patients During Emergency Department Care

Enhancing Performance of the National Field Triage Guidelines Using Machine Learning: Development of a Prehospital Triage Model to Predict Severe Trauma

Identifying Age-Specific Risk Factors for Poor Outcomes After Trauma With Machine Learning

Multimodal Data Hybrid Fusion and Natural Language Processing for Clinical Prediction Models

In Search of the Truth: Choice of Ground-Truth for Predictive Modeling of Trauma Team Activation in Pediatric Trauma

Predicting Mortality and Functional Status Scores of Traumatic Brain Injury Patients using Supervised Machine Learning

Hand-carried ultrasound-guided pericardiocentesis and thoracentesis.

A Computer-Assisted System for Early Mortality Risk Prediction in Patients with Traumatic Brain Injury Using Artificial Intelligence Algorithms in Emergency Room Triage

Machine learning models predict triage levels, massive transfusion protocol activation, and mortality in trauma utilizing patients hemodynamics on admission

Transitory schizophrenias produced by bromide intoxication.

Accuracy Enhancement of Early Triage for Severely Injured Patients in Emergency Medical Dispatch through Machine Learning Based Text Analysis (Preprint)

Development of a System for Predicting Hospitalization Time for Patients With Traumatic Brain Injury Based on Machine Learning Algorithms: User-Centered Design Case Study

Machine Learning Algorithm Predicts Mortality Risk in Intensive Care Unit for Patients with Traumatic Brain Injury

Predicting the Complexity and Mortality of Polytrauma Patients with Machine Learning Models

Machine Learning with Objective Serum Markers and Algorithmic Deep Learning Computed Tomography Scan Analysis for Classification of Brain Injury

466 Development of Machine Learning Algorithms to Predict Symptomatic VTE at Time of Admission and Time of Discharge after Severe Traumatic Injury

757 Machine Learning-Driven Web Application for Enhanced Clinical Decision Support in Thoracolumbar Spinal Cord Injuries

Empirical Analysis of Machine Learning Configurations for Prediction of Multiple Organ Failure in Trauma Patients

Automated Hematoma Detection and Outcome Prediction in Patients With Traumatic Brain Injury

Real-Time Prediction of Sepsis in Critical Trauma Patients: Machine Learning–Based Modeling Study