YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images

Chun-Tse Chien,Rui-Yang Ju,Kuang-Yi Chou,Jen-Shiun Chiang
2024-05-27
Abstract:The introduction of YOLOv9, the latest version of the You Only Look Once (YOLO) series, has led to its widespread adoption across various scenarios. This paper is the first to apply the YOLOv9 algorithm model to the fracture detection task as computer-assisted diagnosis (CAD) to help radiologists and surgeons to interpret X-ray images. Specifically, this paper trained the model on the GRAZPEDWRI-DX dataset and extended the training set using data augmentation techniques to improve the model performance. Experimental results demonstrate that compared to the mAP 50-95 of the current state-of-the-art (SOTA) model, the YOLOv9 model increased the value from 42.16% to 43.73%, with an improvement of 3.7%. The implementation code is publicly available at this https URL.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in the X - ray images of children's wrist fractures, how to use the YOLOv9 algorithm model for accurate fracture detection to assist radiologists and surgeons in interpreting X - ray images. Specifically, the paper aims to improve the performance of existing computer - aided diagnosis (CAD) systems in low - feature X - ray images by introducing the YOLOv9 algorithm, thereby increasing the accuracy and efficiency of fracture detection. ### Main contributions of the paper: 1. **First application of YOLOv9**: This is the first time that YOLOv9 has been applied to the fracture detection task, proving that this model not only performs well in real - time object detection but also has good performance in medical image recognition. 2. **Reduce information loss**: By using the YOLOv9 algorithm, especially on low - feature X - ray images, the problem of information loss is solved, more useful information is retained, and thus the performance of the model is improved. 3. **Significantly improve mAP 50 - 95**: The mAP 50 - 95 value of the YOLOv9 model trained on the GRAZPEDWRI - DX dataset has increased from 42.16% to 43.73%, reaching the current state - of - the - art level (SOTA). ### Specific methods for solving the problem: - **Dataset**: The GRAZPEDWRI - DX dataset was used, which contains 20,327 X - ray images of children's wrist trauma. - **Data augmentation**: The training set was expanded through data augmentation techniques to improve the generalization ability and robustness of the model. - **YOLOv9 algorithm**: Programmable Gradient Information (PGI) and Generalized Efficient Layer Aggregation Network (GELAN) were introduced to more effectively extract key features and reduce information loss. - **Experimental setup**: Training was carried out using an NVIDIA GeForce RTX 3090 GPU, the SGD optimizer was used, and experiments were carried out on different input image sizes (640 and 1024). Through these methods, the paper has successfully improved the accuracy of fracture detection and provided strong support for medical image analysis.