PHE-SICH-CT-IDS: A Benchmark CT Image Dataset for Evaluation Semantic Segmentation, Object Detection and Radiomic Feature Extraction of Perihematomal Edema in Spontaneous Intracerebral Hemorrhage

Deguo Ma,Chen Li,Lin Qiao,Tianming Du,Dechao Tang,Zhiyu Ma,Marcin Grzegorzek Hongzan,Hongzan Sun
2023-08-21
Abstract:Intracerebral hemorrhage is one of the diseases with the highest mortality and poorest prognosis worldwide. Spontaneous intracerebral hemorrhage (SICH) typically presents acutely, prompt and expedited radiological examination is crucial for diagnosis, localization, and quantification of the hemorrhage. Early detection and accurate segmentation of perihematomal edema (PHE) play a critical role in guiding appropriate clinical intervention and enhancing patient prognosis. However, the progress and assessment of computer-aided diagnostic methods for PHE segmentation and detection face challenges due to the scarcity of publicly accessible brain CT image datasets. This study establishes a publicly available CT dataset named PHE-SICH-CT-IDS for perihematomal edema in spontaneous intracerebral hemorrhage. The dataset comprises 120 brain CT scans and 7,022 CT images, along with corresponding medical information of the patients. To demonstrate its effectiveness, classical algorithms for semantic segmentation, object detection, and radiomic feature extraction are evaluated. The experimental results confirm the suitability of PHE-SICH-CT-IDS for assessing the performance of segmentation, detection and radiomic feature extraction methods. To the best of our knowledge, this is the first publicly available dataset for PHE in SICH, comprising various data formats suitable for applications across diverse medical scenarios. We believe that PHE-SICH-CT-IDS will allure researchers to explore novel algorithms, providing valuable support for clinicians and patients in the clinical setting. PHE-SICH-CT-IDS is freely published for non-commercial purpose at: <a class="link-external link-https" href="https://figshare.com/articles/dataset/PHE-SICH-CT-IDS/23957937" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the issues of segmentation, detection, and radiomics feature extraction of perihematomal edema (PHE) in spontaneous intracerebral hemorrhage (SICH). Specifically: 1. **Dataset Construction**: The paper establishes a publicly available CT dataset, PHE-SICH-CT-IDS, for evaluating the performance of semantic segmentation, object detection, and radiomics feature extraction algorithms in SICH. This dataset includes brain CT scan images of 120 patients and their corresponding medical information. 2. **Technical Validation**: The dataset is evaluated for segmentation tasks using various classical machine learning methods (such as k-means, MRF, Otsu, and watershed algorithms) and deep learning methods (such as U-Net, SegNet, SwinUNet, and TransUNet). Additionally, Faster R-CNN, SSD, and YOLO series algorithms are used for object detection. 3. **Radiomics Feature Analysis**: A total of 1316 texture features are automatically extracted from the segmented PHE regions, including first-order statistical features, shape features, gray-level co-occurrence matrix (GLCM) features, etc. Features with ICC>0.80 are selected through consistency testing for subsequent modeling. 4. **Clinical Application Value**: The researchers hope that this dataset can promote the development of new algorithms and provide valuable diagnostic tools for clinicians, thereby improving patient outcomes. In summary, this paper is dedicated to advancing computer-aided diagnostic technology for SICH-related diseases by constructing a high-quality dataset, providing important benchmark resources for PHE segmentation, detection, and radiomics feature extraction.