A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation

Avisha Kumar,Kunal Kotkar,Kelly Jiang,Meghana Bhimreddy,Daniel Davidar,Carly Weber-Levine,Siddharth Krishnan,Max J. Kerensky,Ruixing Liang,Kelley Kempski Leadingham,Denis Routkevitch,Andrew M. Hersh,Kimberly Ashayeri,Betty Tyler,Ian Suk,Jennifer Son,Nicholas Theodore,Nitish Thakor,Amir Manbachi
2024-09-25
Abstract:While deep learning has catalyzed breakthroughs across numerous domains, its broader adoption in clinical settings is inhibited by the costly and time-intensive nature of data acquisition and annotation. To further facilitate medical machine learning, we present an ultrasound dataset of 10,223 Brightness-mode (B-mode) images consisting of sagittal slices of porcine spinal cords (N=25) before and after a contusion injury. We additionally benchmark the performance metrics of several state-of-the-art object detection algorithms to localize the site of injury and semantic segmentation models to label the anatomy for comparison and creation of task-specific architectures. Finally, we evaluate the zero-shot generalization capabilities of the segmentation models on human ultrasound spinal cord images to determine whether training on our porcine dataset is sufficient for accurately interpreting human data. Our results show that the YOLOv8 detection model outperforms all evaluated models for injury localization, achieving a mean Average Precision (mAP50-95) score of 0.606. Segmentation metrics indicate that the DeepLabv3 segmentation model achieves the highest accuracy on unseen porcine anatomy, with a Mean Dice score of 0.587, while SAMed achieves the highest Mean Dice score generalizing to human anatomy (0.445). To the best of our knowledge, this is the largest annotated dataset of spinal cord ultrasound images made publicly available to researchers and medical professionals, as well as the first public report of object detection and segmentation architectures to assess anatomical markers in the spinal cord for methodology development and clinical applications.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address two main issues: 1. **Lack of datasets**: Currently, in the field of spinal cord injury (SCI) ultrasound imaging, there is a lack of large-scale, high-quality, and publicly available datasets. This limits the application of deep learning methods in clinical environments due to the high cost and time-consuming nature of data acquisition and annotation. 2. **Challenges in automatic detection and segmentation**: Existing deep learning models need further improvement in the performance of spinal cord injury localization and anatomical structure segmentation. Particularly in the real-time monitoring of spinal cord parameters (such as hematoma formation and tissue inflammation), current methods are insufficient and cannot achieve efficient and accurate automated diagnosis. To address these issues, the study provides the following solutions: - **Building a large-scale open-source dataset**: The research team created a dataset containing 10,223 B-mode ultrasound images obtained from the spinal cords of 25 pigs (both before and after injury). This dataset includes images of both healthy and injured spinal cords, covering various anatomical structures (such as dura mater, cerebrospinal fluid, arachnoid, spinal cord, etc.). - **Benchmarking deep learning models**: The research team benchmarked several state-of-the-art object detection algorithms to evaluate their performance in localizing spinal cord injury sites (i.e., hematomas). They also benchmarked semantic segmentation models to assess their performance in labeling anatomical structures. - **Zero-shot generalization capability assessment**: The research team also evaluated the zero-shot generalization capability of these segmentation models on human spinal cord ultrasound images to determine whether models trained on the pig dataset can accurately interpret human data. Through these efforts, the research team hopes to advance medical computer vision, improve the automatic monitoring and diagnosis of spinal cord injuries, and ultimately enhance clinical outcomes.