Abstract:The application of medical image segmentation technology facilitates precise tissue localization, aiding doctors in accurate diagnoses. To address limitations of current methods, we propose a Semi‐supervised Contextual Cognitive Augmentation‐based Cross‐teaching Network. This network incorporates a Contextual Cognitive Enhancement Module, employing data augmentation techniques and information extraction mechanisms to improve segmentation accuracy. By utilizing a cross‐teaching strategy and hybrid loss function, our approach encourages knowledge sharing between networks, leading to substantial improvements in multiclass medical image segmentation over existing single‐framework networks, as demonstrated in experimental results. The application of medical image segmentation technology enables accurate localization of human tissues, providing doctors with a reliable foundation for diagnosis. While deep learning methods have proven effective in this task, most current approaches rely on a single prediction framework, which overlooks Edge semantic features and results in flawed texture features. Moreover, existing supervised methods face challenges due to limited availability of high‐quality annotations in the field of medical imaging. In this article, a Semi‐supervised Contextual Cognitive Augmentation‐based Cross‐teaching Network is proposed. A Contextual Cognitive Enhancement Module is introduced consisting of two components: data augmentation and information extraction. The data augmentation component provides multi‐level data distribution by incorporating diverse perturbation strategies such as Discrete Cosine Transform and Gaussian noise. The information extraction component employs the Comprehensive Information Extraction module, which consists of Global Perception Information Extraction module and Multi‐channel Information Extraction module to extract perceptual information from images and enhance interaction between image channels, respectively. Additionally, a cross‐teaching strategy is adopted and a hybrid loss function is utilized to encourage knowledge sharing among the networks, leveraging the advantages of dual networks for improved performance. Experimental results demonstrate significant enhancements in multiclass medical image segmentation compared to several state‐of‐the‐art single‐framework networks.

Text-Guided Neural Network Training for Image Recognition in Natural Scenes and Medicine

Enhancing medical text detection with vision-language pre-training and efficient segmentation

TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

When CNN Meet with ViT: Towards Semi-Supervised Learning for Multi-Class Medical Image Semantic Segmentation

Efficient Neural Network for Text Recognition in Natural Scenes Based on End-to-End Multi-Scale Attention Mechanism

Artificial Convolutional Neural Network in Object Detection and Semantic Segmentation for Medical Imaging Analysis

Text-Attentional Convolutional Neural Networks for Scene Text Detection

Text-Attentional Convolutional Neural Network for Scene Text Detection

A comprehensive survey on convolutional neural network in medical image analysis

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

A Text-Context-Aware CNN Network for Multi-oriented and Multi-language Scene Text Detection.

CM-SegNet: A Deep Learning-Based Automatic Segmentation Approach for Medical Images by Combining Convolution and Multilayer Perceptron

Semi‐supervised contextual cognitive augmentation‐based cross‐teaching network for multiclass medical image segmentation

A Novel Global Spatial Attention Mechanism in Convolutional Neural Network for Medical Image Classification

A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application

Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

CCNNet: a novel lightweight convolutional neural network and its application in traditional Chinese medicine recognition

CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

A Synergic Neural Network for Medical Image Classification Based on Attention Mechanism