From Pixel to Slide image: Polarization Modality-based Pathological Diagnosis Using Representation Learning

Jia Dong,Yao Yao,Yang Dong,Hui Ma
2024-01-03
Abstract:Thyroid cancer is the most common endocrine malignancy, and accurately distinguishing between benign and malignant thyroid tumors is crucial for developing effective treatment plans in clinical practice. Pathologically, thyroid tumors pose diagnostic challenges due to improper specimen sampling. In this study, we have designed a three-stage model using representation learning to integrate pixel-level and slice-level annotations for distinguishing thyroid tumors. This structure includes a pathology structure recognition method to predict structures related to thyroid tumors, an encoder-decoder network to extract pixel-level annotation information by learning the feature representations of image blocks, and an attention-based learning mechanism for the final classification task. This mechanism learns the importance of different image blocks in a pathological region, globally considering the information from each block. In the third stage, all information from the image blocks in a region is aggregated using attention mechanisms, followed by classification to determine the category of the region. Experimental results demonstrate that our proposed method can predict microscopic structures more accurately. After color-coding, the method achieves results on unstained pathology slides that approximate the quality of Hematoxylin and eosin staining, reducing the need for stained pathology slides. Furthermore, by leveraging the concept of indirect measurement and extracting polarized features from structures correlated with lesions, the proposed method can also classify samples where membrane structures cannot be obtained through sampling, providing a potential objective and highly accurate indirect diagnostic technique for thyroid tumors.
Image and Video Processing,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the problem of benign and malignant classification of thyroid tumors, especially the diagnostic challenges in pathological diagnosis due to improper sample collection. Specifically, the article proposes a three - stage model to more accurately distinguish between benign and malignant thyroid tumors by integrating pixel - level and slice - level annotation information through representation learning. ### Problem Background Thyroid cancer is the most common endocrine malignant tumor. Accurately distinguishing between benign and malignant thyroid tumors is crucial for formulating effective treatment plans. However, in pathology, the diagnosis of thyroid tumors faces challenges such as improper sample collection. Therefore, developing a method that can improve the diagnostic accuracy is of great significance. ### The Method Proposed in the Paper To address the above problems, this paper designs a three - stage model: 1. **Pathological Structure Recognition**: Use the pathological structure recognition method by predicting structures related to thyroid tumors (such as cell, fiber, and colloid structures). 2. **Encoder - Decoder Network**: Extract pixel - level annotation information by learning the feature representations of image patches. This stage aims to connect annotation information at different levels and generate probability maps for structure classification. 3. **Attention - mechanism - based Learning**: In the third stage, use the attention mechanism to aggregate all the image patch information within the region for the final classification task. This mechanism learns the importance of different image patches in the entire pathological region and globally considers the information of each patch. ### Innovation Points - **Application of Polarization Modes**: Utilize polarization imaging technology to extract polarization features from lesion - related structures, providing an indirect measurement method. - **Virtual Staining**: Through color coding, this method can achieve an effect similar to hematoxylin and eosin (H&E) staining on unstained pathological slices, reducing the need for stained pathological slices. - **Multi - scale Information Fusion**: Combine pixel - level and slice - level annotations to improve the accuracy of classification. ### Experimental Results The experimental results show that the proposed model is more accurate in predicting microstructures. In addition, by introducing noise and verifying the robustness of the model, the effectiveness and stability of this method are proved. ### Summary This paper proposes a new thyroid tumor classification method by introducing polarization modes and representation learning, which not only improves the diagnostic accuracy but also provides a more objective and efficient diagnostic tool for the clinic.