Enhancing cervical cancer diagnosis with graph convolution network: AI-powered segmentation, feature analysis, and classification for early detection

Nur Mohammad Fahad,Sami Azam,Sidratul Montaha,Md. Saddam Hossain Mukta
DOI: https://doi.org/10.1007/s11042-024-18608-y
IF: 2.577
2024-02-17
Multimedia Tools and Applications
Abstract:Cervical cancer is a prevalent disease affecting the cervix cells in women and is one of the leading causes of mortality for women globally. The Pap smear test determines the risk of cervical cancer by detecting abnormal cervix cells. Early detection and diagnosis of this cancer can effectively increase the patient's survival rate. The advent of artificial intelligence facilitates the development of automated computer-assisted cervical cancer diagnostic systems, which are widely used to enhance cancer screening. This study emphasizes the segmentation and classification of various cervical cancer cell types. An intuitive but effective segmentation technique is used to segment the nucleus and cytoplasm from histopathological cell images. Additionally, handcrafted features include different properties of the cells generated from the distinct cervical cytoplasm and nucleus area. Two feature rankings techniques are conducted to evaluate this study's significant feature set. Feature analysis identifies the critical pathological properties of cervical cells and then divides them into 30, 40, and 50 sets of diagnostic features. Furthermore, a graph dataset is constructed using the strongest correlated features, prioritizes the relationship between the features, and a robust graph convolution network (GCN) is introduced to efficiently predict the cervical cell types. The proposed model obtains a sublime accuracy of 99.11% for the 40-feature set of the SipakMed dataset. This study outperforms the existing study, performing both segmentation and classification simultaneously, conducting an in-depth feature analysis, attaining maximum accuracy efficiently, and ensuring the interpretability of the proposed model. To validate the model's outcome, we tested it on the Herlev dataset and highlighted its robustness by attaining an accuracy of 98.18%. The results of this proposed methodology demonstrate the dependability of this study effectively, detecting cervical cancer in its early stages and upholding the significance of the lives of women.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the early detection and diagnosis of cervical cancer. Specifically, the paper aims to develop an automated computer - aided system for the segmentation, feature analysis and classification of cervical cancer cells by combining image processing, deep learning and Graph Convolution Network (GCN) techniques, in order to improve the accuracy and efficiency of early detection. The paper mentions that cervical cancer is a major threat to women's health, with a large number of new cases and deaths worldwide every year. Early detection and diagnosis can significantly improve the survival rate of patients. Therefore, this paper proposes a new method to enhance the screening and diagnosis ability of cervical cancer by efficiently identifying and analyzing the key features in cervical cell images. ### Main research objectives: 1. **Simultaneous segmentation of multiple nuclei and cytoplasm**: Use a variety of common image pre - processing methods to simultaneously segment multiple nuclei and cytoplasm from histopathological images. 2. **Extract hand - crafted features**: Extract 26 hand - crafted features from the segmented nuclear and cytoplasmic regions. 3. **Extensive feature analysis**: Conduct feature analysis to identify important features and find patterns among these features through feature evaluation. 4. **Construct graph data set**: Construct a graph data set using the strongest correlations among features. Use the Pearson correlation coefficient method to determine the positive correlations among features, and establish edges between feature nodes with a correlation score exceeding 0.5. 5. **Implement Graph Convolution Network (GCN) architecture**: Use GCN to perform five - class classification for classification tasks: dyskeratotic cells, superficial cells, parabasal cells, metaplastic cells and koilocytes. And compare the performance with multiple machine - learning models to verify the robustness of the GCN model. 6. **Validation of effectiveness on different data sets**: Use two different data sets (SipakMed and Herlev) to verify the effectiveness of the model, highlighting its robustness and adaptability in different scenarios. ### Contribution summary: - Proposed a method for the simultaneous segmentation of multiple nuclei and cytoplasm. - Extracted 26 hand - crafted features, covering the texture, shape and intensity information of nuclei and cytoplasm. - Conducted extensive feature analysis and identified important features. - Constructed a graph data set based on the strongest correlations among features. - Implemented the GCN architecture for classification tasks and achieved high accuracy. - Verified on two different data sets, demonstrating the robustness and adaptability of the model. Through these methods, the paper aims to provide a reliable and efficient early - detection tool for cervical cancer, thereby improving women's survival rate and quality of life.