Cytopathology image analysis method based on high-resolution medical representation learning in medical decision-making system

Baotian Li,Feng Liu,Baolong Lv,Yongjun Zhang,Fangfang Gou,Jia Wu
DOI: https://doi.org/10.1007/s40747-024-01390-7
IF: 6.7
2024-03-04
Complex & Intelligent Systems
Abstract:Abstract Artificial intelligence has made substantial progress in many medical application scenarios. The quantity and complexity of pathology images are enormous, but conventional visual screening techniques are labor-intensive, time-consuming, and subject to some degree of subjectivity. Complex pathological data can be converted into mineable image features using artificial intelligence image analysis technology, enabling medical professionals to quickly and quantitatively identify regions of interest and extract information about cellular tissue. In this study, we designed a medical information assistance system for segmenting pathology images and quantifying statistical results, including data enhancement, cell nucleus segmentation, model tumor, and quantitative analysis. In cell nucleus segmentation, to address the problem of uneven healthcare resources, we designed a high-precision teacher model (HRMED_T) and a lightweight student model (HRMED_S). The HRMED_T model is based on visual Transformer and high-resolution representation learning. It achieves accurate segmentation by parallel low-resolution convolution and high-scaled image iterative fusion, while also maintaining the high-resolution representation. The HRMED_S model is based on the Channel-wise Knowledge Distillation approach to simplify the structure, achieve faster convergence, and refine the segmentation results by using conditional random fields instead of fully connected structures. The experimental results show that our system has better performance than other methods. The Intersection over the Union (IoU) of HRMED_T model reaches 0.756. The IoU of HRMED_S model also reaches 0.710 and params is only 3.99 M.
computer science, artificial intelligence
What problem does this paper attempt to address?
The paper aims to address several key issues in cytopathology image analysis, particularly the use of high-resolution medical representation learning for cytopathology image analysis in medical decision systems. Specifically, the main objectives of the study include: 1. **Addressing the high cost of pixel-level annotation**: Pixel-level annotation of cytopathology images requires a significant amount of work from professionals, which is especially challenging to achieve in economically underdeveloped regions. 2. **Improving processing efficiency**: The large number of pathological slides and the ultra-high resolution of each image make manual screening and processing a time-consuming and labor-intensive task. 3. **Enhancing model accuracy and generalizability while reducing computational resource requirements**: High-accuracy nuclear segmentation models are often complex in structure and have high computational demands, limiting their application in regions with limited medical resources. 4. **Alleviating the uneven distribution of medical resources**: Many countries have a concentration of medical resources in urban areas while rural areas lack resources, making it difficult for patients in these regions to receive timely diagnoses. To address the above challenges, the authors designed a medical information assistance system that includes functions such as data augmentation, nuclear segmentation, tumor modeling, and quantitative analysis. The core components of the system include: - **High-accuracy teacher model (HRMED_T)**: Based on the Vision Transformer and high-resolution representation learning, it allows the Transformer to replace convolutional structures through a cross-window design and employs multi-stream fusion and parallel low-resolution convolution to achieve accurate segmentation while maintaining high-resolution representation. - **Lightweight student model (HRMED_S)**: Simplified using the Channel-wise Knowledge Distillation method, it employs conditional random fields instead of fully connected layers to refine segmentation results, achieving faster convergence and lower parameter count. Additionally, the study proposes a series of technical solutions, such as data augmentation processes and a unified focal loss function for imbalanced classes, to improve system performance. Experimental results show that the proposed system performs better than other methods, particularly in the task of nuclear segmentation.