Peanut lectin histochemistry of 120 mammary carcinomas and its relation to tumor type, grading, staging, and receptor status

W. Böcker,A. Klaubert,J. Bahnsen,G. Schweikhart,K. Pollow,M. Mitze,R. Kreienberg,T. Beck,H. Stegner

DOI: https://doi.org/10.1007/BF00695231

Virchows Archiv

Abstract:

What problem does this paper attempt to address?

Diffusion model-based text-guided enhancement network for medical image segmentation

Zhiwei Dong,Genji Yuan,Zhen Hua,Jinjiang Li

DOI: https://doi.org/10.1016/j.eswa.2024.123549

IF: 8.5

2024-09-01

Expert Systems with Applications

Abstract:In recent years, denoising diffusion models have achieved remarkable success in generating pixel-level representations with semantic values for image generation modeling. In this study, we propose a novel end-to-end framework, called TGEDiff, focusing on medical image segmentation. TGEDiff fuses a textual attention mechanism with the diffusion model by introducing an additional auxiliary categorization task to guide the diffusion model with textual information to generate excellent pixel-level representations. To overcome the limitation of limited perceptual fields for independent feature encoders within the diffusion model, we introduce a multi-kernel excitation module to extend the model’s perceptual capability. Meanwhile, a guided feature enhancement module is introduced in Denoising-UNet to focus the model’s attention on important regions and attenuate the influence of noise and irrelevant background in medical images. We critically evaluated TGEDiff on three datasets (Kvasir-SEG, Kvasir-Sessile, and GLaS), and TGEDiff achieved significant improvements over the state-of-the-art approach on all three datasets, with F1 scores and mIoU improving by 0.88% and 1.09%, 3.21% and 3.43%, respectively, 1.29% and 2.34%. These data validate that TGEDiff has excellent performance in medical image segmentation. TGEDiff is expected to facilitate accurate diagnosis and treatment of medical diseases through more precise deconvolutional structural segmentation.

computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models

Chun-Mei Feng

2024-07-07

Abstract:Aside from offering state-of-the-art performance in medical image generation, denoising diffusion probabilistic models (DPM) can also serve as a representation learner to capture semantic information and potentially be used as an image representation for downstream tasks, e.g., segmentation. However, these latent semantic representations rely heavily on labor-intensive pixel-level annotations as supervision, limiting the usability of DPM in medical image segmentation. To address this limitation, we propose an enhanced diffusion segmentation model, called TextDiff, that improves semantic representation through inexpensive medical text annotations, thereby explicitly establishing semantic representation and language correspondence for diffusion models. Concretely, TextDiff extracts intermediate activations of the Markov step of the reverse diffusion process in a pretrained diffusion model on large-scale natural images and learns additional expert knowledge by combining them with complementary and readily available diagnostic text information. TextDiff freezes the dual-branch multi-modal structure and mines the latent alignment of semantic features in diffusion models with diagnostic descriptions by only training the cross-attention mechanism and pixel classifier, making it possible to enhance semantic representation with inexpensive text. Extensive experiments on public QaTa-COVID19 and MoNuSeg datasets show that our TextDiff is significantly superior to the state-of-the-art multi-modal segmentation methods with only a few training samples.

Computer Vision and Pattern Recognition
HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation

Tao Chen,Chenhui Wang,Zhihao Chen,Yiming Lei,Hongming Shan

2024-07-04

Abstract:Medical image segmentation has been significantly advanced with the rapid development of deep learning (DL) techniques. Existing DL-based segmentation models are typically discriminative; i.e., they aim to learn a mapping from the input image to segmentation masks. However, these discriminative methods neglect the underlying data distribution and intrinsic class characteristics, suffering from unstable feature space. In this work, we propose to complement discriminative segmentation methods with the knowledge of underlying data distribution from generative models. To that end, we propose a novel hybrid diffusion framework for medical image segmentation, termed HiDiff, which can synergize the strengths of existing discriminative segmentation models and new generative diffusion models. HiDiff comprises two key components: discriminative segmentor and diffusion refiner. First, we utilize any conventional trained segmentation models as discriminative segmentor, which can provide a segmentation mask prior for diffusion refiner. Second, we propose a novel binary Bernoulli diffusion model (BBDM) as the diffusion refiner, which can effectively, efficiently, and interactively refine the segmentation mask by modeling the underlying data distribution. Third, we train the segmentor and BBDM in an alternate-collaborative manner to mutually boost each other. Extensive experimental results on abdomen organ, brain tumor, polyps, and retinal vessels segmentation datasets, covering four widely-used modalities, demonstrate the superior performance of HiDiff over existing medical segmentation algorithms, including the state-of-the-art transformer- and diffusion-based ones. In addition, HiDiff excels at segmenting small objects and generalizing to new datasets. Source codes are made available at <a class="link-external link-https" href="https://github.com/takimailto/HiDiff" rel="external noopener nofollow">this https URL</a>.

Computer Vision and Pattern Recognition
Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models

Houze Liu,Tong Zhou,Yanlin Xiang,Aoran Shen,Jiacheng Hu,Junliang Du

2024-11-22

Abstract:Medical image segmentation is crucial for accurate clinical diagnoses, yet it faces challenges such as low contrast between lesions and normal tissues, unclear boundaries, and high variability across patients. Deep learning has improved segmentation accuracy and efficiency, but it still relies heavily on expert annotations and struggles with the complexities of medical images. The small size of medical image datasets and the high cost of data acquisition further limit the performance of segmentation networks. Diffusion models, with their iterative denoising process, offer a promising alternative for better detail capture in segmentation. However, they face difficulties in accurately segmenting small targets and maintaining the precision of boundary details. This article discusses the importance of medical image segmentation, the limitations of current deep learning approaches, and the potential of diffusion models to address these challenges.

Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
FDiff-Fusion:Denoising diffusion fusion network based on fuzzy learning for 3D medical image segmentation

Weiping Ding,Sheng Geng,Haipeng Wang,Jiashuang Huang,Tianyi Zhou

DOI: https://doi.org/10.1016/J.INFFUS.2024.102540

2024-07-22

Abstract:In recent years, the denoising diffusion model has achieved remarkable success in image segmentation modeling. With its powerful nonlinear modeling capabilities and superior generalization performance, denoising diffusion models have gradually been applied to medical image segmentation tasks, bringing new perspectives and methods to this field. However, existing methods overlook the uncertainty of segmentation boundaries and the fuzziness of regions, resulting in the instability and inaccuracy of the segmentation results. To solve this problem, a denoising diffusion fusion network based on fuzzy learning for 3D medical image segmentation (FDiff-Fusion) is proposed in this paper. By integrating the denoising diffusion model into the classical U-Net network, this model can effectively extract rich semantic information from input medical images, thus providing excellent pixel-level representation for medical image segmentation. ... Finally, to validate the effectiveness of FDiff-Fusion, we compare it with existing advanced segmentation networks on the BRATS 2020 brain tumor dataset and the BTCV abdominal multi-organ dataset. The results show that FDiff-Fusion significantly improves the Dice scores and HD95 distance on these two datasets, demonstrating its superiority in medical image segmentation tasks.

Computer Vision and Pattern Recognition
Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation

Zhaohu Xing,Liang Wan,Huazhu Fu,Guang Yang,Lei Zhu

2023-03-18

Abstract:In recent years, Denoising Diffusion Models have demonstrated remarkable success in generating semantically valuable pixel-wise representations for image generative modeling. In this study, we propose a novel end-to-end framework, called Diff-UNet, for medical volumetric segmentation. Our approach integrates the diffusion model into a standard U-shaped architecture to extract semantic information from the input volume effectively, resulting in excellent pixel-level representations for medical volumetric segmentation. To enhance the robustness of the diffusion model's prediction results, we also introduce a Step-Uncertainty based Fusion (SUF) module during inference to combine the outputs of the diffusion models at each step. We evaluate our method on three datasets, including multimodal brain tumors in MRI, liver tumors, and multi-organ CT volumes, and demonstrate that Diff-UNet outperforms other state-of-the-art methods significantly. Our experimental results also indicate the universality and effectiveness of the proposed model. The proposed framework has the potential to facilitate the accurate diagnosis and treatment of medical conditions by enabling more precise segmentation of anatomical structures. The codes of Diff-UNet are available at <a class="link-external link-https" href="https://github.com/ge-xing/Diff-UNet" rel="external noopener nofollow">this https URL</a>

Image and Video Processing,Computer Vision and Pattern Recognition
Aspirin in the prevention of cardiovascular disease in women.

J. Dalen

DOI: https://doi.org/10.1056/nejm200506303522617

IF: 158.5

New England Journal of Medicine

Abstract:
UNet based on dynamic convolution decomposition and triplet attention

Yang Li,Bobo Yan,Jianxin Hou,Bingyang Bai,Xiaoyu Huang,Canfei Xu,Limei Fang

DOI: https://doi.org/10.1038/s41598-023-50989-2

IF: 4.6

2024-01-02

Scientific Reports

Abstract:Abstract The robustness and generalization of medical image segmentation models are being challenged by the differences between different disease types, different image types, and different cases.Deep learning based semantic segmentation methods have been providing state-of-the-art performance in the last few years. One deep learning technique, U-Net, has become the most popular architecture in the medical imaging segmentation. Despite outstanding overall performance in segmenting medical images, it still has the problems of limited feature expression ability and inaccurate segmentation. To this end, we propose a DTA-UNet based on Dynamic Convolution Decomposition (DCD) and Triple Attention (TA). Firstly, the model with Attention U-Net as the baseline network uses DCD to replace all the conventional convolution in the encoding-decoding process to enhance its feature extraction capability. Secondly, we combine TA with Attention Gate (AG) to be used for skip connection in order to highlight lesion regions by removing redundant information in both spatial and channel dimensions. The proposed model are tested on the two public datasets and actual clinical dataset such as the public COVID-SemiSeg dataset, the ISIC 2018 dataset, and the cooperative hospital stroke segmentation dataset. Ablation experiments on the clinical stroke segmentation dataset show the effectiveness of DCD and TA with only a 0.7628 M increase in the number of parameters compared to the baseline model. The proposed DTA-UNet is further evaluated on the three datasets of different types of images to verify its universality. Extensive experimental results show superior performance on different segmentation metrics compared to eight state-of-art methods.The GitHub URL of our code is https://github.com/shuaihou1234/DTA-UNet .

multidisciplinary sciences
Localization of hemoptysis in patients with cystic fibrosis.

C. Ores,D. Baker

DOI: https://doi.org/10.1164/ARRD.1969.99.5.790

1969-05-01

American Review of Respiratory Disease

Abstract:
FDiff-Fusion: Denoising Diffusion Fusion Network Based on Fuzzy Learning for 3D Medical Image Segmentation

Weiping Ding,Sheng Geng,Haipeng Wang,Jiashuang Huang,Tianyi Zhou

DOI: https://doi.org/10.1016/j.inffus.2024.102540

IF: 18.6

2024-01-01

Information Fusion

Abstract:In recent years, the denoising diffusion model has achieved remarkable success in image segmentation modeling. With its powerful nonlinear modeling capabilities and superior generalization performance, denoising diffusion models have gradually been applied to medical image segmentation tasks, bringing new perspectives and methods to this field. However, existing methods overlook the uncertainty of segmentation boundaries and the fuzziness of regions, resulting in the instability and inaccuracy of the segmentation results. To solve this problem, a denoising diffusion fusion network based on fuzzy learning for 3D medical image segmentation (FDiff-Fusion) is proposed in this paper. By integrating the denoising diffusion model into the classical U-Net network, this model can effectively extract rich semantic information from input medical images, thus providing excellent pixel-level representation for medical image segmentation. In this paper, a fuzzy learning module is designed on the skip path of U-Net network because of the widespread boundary uncertainty and region blurring of medical image segmentation. The module sets several fuzzy membership functions for the input encoded features to describe the similarity degree between the feature points, and applies fuzzy rules to the fuzzy membership functions, thus enhancing the modeling ability of the model for uncertain boundaries and fuzzy regions. In addition, in order to improve the accuracy and robustness of the model segmentation results, we introduced an iterative attention feature fusion method in the test phase, which added local context information to the global context information in the attention module to fuse the prediction results of each denoising time step. Finally, to validate the effectiveness of FDiff-Fusion, we compare it with existing advanced segmentation networks on the BRATS 2020 brain tumor dataset and the BTCV abdominal multi-organ dataset. The results show that FDiff-Fusion significantly improves the Dice scores and HD95 distance on these two datasets, demonstrating its superiority in medical image segmentation tasks.
Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer

Sungmin Kang,Jaeha Song,Jihie Kim

2024-09-01

Abstract:Understanding the morphological structure of medical images and precisely segmenting the region of interest or abnormality is an important task that can assist in diagnosis. However, the unique properties of medical imaging make clear segmentation difficult,and the high cost and time-consuming task of labeling leads to a coarse-grained representation of ground truth. Facing with these problems, we propose a novel Diffusion Transformer Segmentation (DTS) model for robust segmentation in the presence of noise. We propose an alternative to the dominant Denoising U-Net encoder through experiments applying a transformer architecture, which captures global dependency through self-attention. Additionally, we propose k-neighbor label smoothing, reverse boundary attention, and self-supervised learning with morphology-driven learning to improve the ability to identify complex structures. Our model, which analyzes the morphological representation of images, shows better results than the previous models in various medical imaging modalities, including CT, MRI, and lesion images.

Computer Vision and Pattern Recognition,Artificial Intelligence
Segmentation of medical images using an attention embedded lightweight network

Junde Chen,Weirong Chen,Adan Zeb,Defu Zhang

DOI: https://doi.org/10.1016/j.engappai.2022.105416

IF: 8

2022-11-01

Engineering Applications of Artificial Intelligence

Abstract:Accurate segmentation of computerized tomography (CT) images is of great significance to clinical diagnosis. However, because of the high similarity of gray values, it is a challenging task for CT image segmentation. The encoder and decoder based CNN architecture has greatly improved the segmentation effect, but it also encounters a bottleneck due to the information loss in the encoding process. In view of this, we proposed an image segmentation model based on a novel network architecture for medical image segmentation. To improve the efficiency and decrease the number of model parameters, we optimized the Inception module by substituting the depth-wise separable convolutions (DWSC) for the standard convolutions. Then, the optimized Inception module paired with the residual network was chosen as the backbone extractor to extract high-quality image features. Further, a hybrid attention mechanism, which consists of channel-wise and spatial attention, was incorporated into the network to realize the maximum reuse of inter-channel relationships and spatial point characteristics. In particular, the attention module was separately embedded into the contracting and expansive paths to enhance the feature extraction capability and detail restoration effects. The experimental indicators were significantly improved on the test dataset, and the intersection over union (IoU) of the proposed method reached no less than 0.9645, 0.6499, and 0.7945 on the Lung, Colon tumor, and DRIVE datasets, respectively, which demonstrated the effectiveness of the proposed method. Our code and data are available at https://github.com/xtu502/medical-image-segmentation/.

automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation

Tao Chen,Chenhui Wang,Hongming Shan

DOI: https://doi.org/10.1007/978-3-031-43901-8_47

2023-04-10

Abstract:Medical image segmentation is a challenging task with inherent ambiguity and high uncertainty, attributed to factors such as unclear tumor boundaries and multiple plausible annotations. The accuracy and diversity of segmentation masks are both crucial for providing valuable references to radiologists in clinical practice. While existing diffusion models have shown strong capacities in various visual generation tasks, it is still challenging to deal with discrete masks in segmentation. To achieve accurate and diverse medical image segmentation masks, we propose a novel conditional Bernoulli Diffusion model for medical image segmentation (BerDiff). Instead of using the Gaussian noise, we first propose to use the Bernoulli noise as the diffusion kernel to enhance the capacity of the diffusion model for binary segmentation tasks, resulting in more accurate segmentation masks. Second, by leveraging the stochastic nature of the diffusion model, our BerDiff randomly samples the initial Bernoulli noise and intermediate latent variables multiple times to produce a range of diverse segmentation masks, which can highlight salient regions of interest that can serve as valuable references for radiologists. In addition, our BerDiff can efficiently sample sub-sequences from the overall trajectory of the reverse diffusion, thereby speeding up the segmentation process. Extensive experimental results on two medical image segmentation datasets with different modalities demonstrate that our BerDiff outperforms other recently published state-of-the-art methods. Our results suggest diffusion models could serve as a strong backbone for medical image segmentation.

Computer Vision and Pattern Recognition,Artificial Intelligence
MTANet: Multi-Task Attention Network for Automatic Medical Image Segmentation and Classification

Jie Yu,Wenli Dai,Dexing Kong,Yuling Wang,Yating Ling,Ping Liang

DOI: https://doi.org/10.1109/TMI.2023.3317088

IF: 10.6

2023-09-19

IEEE Transactions on Medical Imaging

Abstract:Medical image segmentation and classification are two of the most key steps in computer-aided clinical diagnosis. The region of interest were usually segmented in a proper manner to extract useful features for further disease classification. However, these methods are computationally complex and time-consuming. In this paper, we proposed a one-stage multi-task attention network (MTANet) which efficiently classifies objects in an image while generating a high-quality segmentation mask for each medical object. A reverse addition attention module was designed in the segmentation task to fusion areas in global map and boundary cues in high-resolution features, and an attention bottleneck module was used in the classification task for image feature and clinical feature fusion. We evaluated the performance of MTANet with CNN-based and transformer-based architectures across three imaging modalities for different tasks: CVC-ClinicDB dataset for polyp segmentation, ISIC-2018 dataset for skin lesion segmentation, and our private ultrasound dataset for liver tumor segmentation and classification. Our proposed model outperformed state-of-the-art models on all three datasets and was superior to all 25 radiologists for liver tumor diagnosis.

Computer Science,Medicine
TA-Net: Triple attention network for medical image segmentation

Yang Li,Jun Yang,Jiajia Ni,Ahmed Elazab,Jianhuang Wu

DOI: https://doi.org/10.1016/j.compbiomed.2021.104836

IF: 7.7

2021-10-01

Computers in Biology and Medicine

Abstract:The automatic segmentation of medical images has made continuous progress due to the development of convolutional neural networks (CNNs) and attention mechanism. However, previous works usually explore the attention features of a certain dimension in the image, thus may ignore the correlation between feature maps in other dimensions. Therefore, how to capture the global features of various dimensions is still facing challenges. To deal with this problem, we propose a triple attention network (TA-Net) by exploring the ability of the attention mechanism to simultaneously recognize global contextual information in the channel domain, spatial domain, and feature internal domain. Specifically, during the encoder step, we propose a channel with self-attention encoder (CSE) block to learn the long-range dependencies of pixels. The CSE effectively increases the receptive field and enhances the representation of target features. In the decoder step, we propose a spatial attention up-sampling (SU) block that makes the network pay more attention to the position of the useful pixels when fusing the low-level and high-level features. Extensive experiments were tested on four public datasets and one local dataset. The datasets include the following types: retinal blood vessels (DRIVE and STARE), cells (ISBI 2012), cutaneous melanoma (ISIC 2017), and intracranial blood vessels. Experimental results demonstrate that the proposed TA-Net is overall superior to previous state-of-the-art methods in different medical image segmentation tasks with high accuracy, promising robustness, and relatively low redundancy.

engineering, biomedical,computer science, interdisciplinary applications,mathematical & computational biology,biology
TransDiffSeg: Transformer-Based Conditional Diffusion Segmentation Model for Abdominal Multi-Objective

WenWen Gu,GuoDong Zhang,RongHui Ju,SuRan Wang,YanLin Li,TingYu Liang,Wei Guo,ZhaoXuan Gong

DOI: https://doi.org/10.1007/s10278-024-01206-7

2024-07-29

Abstract:In the domain of medical image segmentation, traditional diffusion probabilistic models are hindered by local inductive biases stemming from convolutional operations, constraining their ability to model long-term dependencies and leading to inaccurate mask generation. Conversely, Transformer offers a remedy by obviating the local inductive biases inherent in convolutional operations, thereby enhancing segmentation precision. Currently, the integration of Transformer and convolution operations mainly occurs in two forms: nesting and stacking. However, both methods address the bias elimination at a relatively large granularity, failing to fully leverage the advantages of both approaches. To address this, this paper proposes a conditional diffusion segmentation model named TransDiffSeg, which combines Transformer with convolution operations from traditional diffusion models in a parallel manner. This approach eliminates the accumulated local inductive bias of convolution operations at a finer granularity within each layer. Additionally, an adaptive feature fusion block is employed to merge conditional semantic features and noise features, enhancing global semantic information and reducing the Transformer's sensitivity to noise features. To validate the impact of granularity in bias elimination on performance and the impact of Transformer in alleviating the accumulated local inductive biases of convolutional operations in diffusion probabilistic models, experiments are conducted on the AMOS22 dataset and BTCV dataset. Experimental results demonstrate that eliminating local inductive bias at a finer granularity significantly improves the segmentation performance of diffusion probabilistic models. Furthermore, the results confirm that the finer the granularity of bias elimination, the better the segmentation performance.
Verdiff-Net: A Conditional Diffusion Framework for Spinal Medical Image Segmentation

Zhiqing Zhang,Tianyong Liu,Guojia Fan,Yao Pu,Bin Li,Xingyu Chen,Qianjin Feng,Shoujun Zhou

DOI: https://doi.org/10.3390/bioengineering11101031

IF: 5.046

2024-10-15

Bioengineering

Abstract:Spinal medical image segmentation is critical for diagnosing and treating spinal disorders. However, ambiguity in anatomical boundaries and interfering factors in medical images often cause segmentation errors. Current deep learning models cannot fully capture the intrinsic data properties, leading to unstable feature spaces. To tackle the above problems, we propose Verdiff-Net, a novel diffusion-based segmentation framework designed to improve segmentation accuracy and stability by learning the underlying data distribution. Verdiff-Net integrates a multi-scale fusion module (MSFM) for fine feature extraction and a noise semantic adapter (NSA) to refine segmentation masks. Validated across four multi-modality spinal datasets, Verdiff-Net achieves a high Dice coefficient of 93%, demonstrating its potential for clinical applications in precision spinal surgery.
TransDiff: medical image segmentation method based on Swin Transformer with diffusion probabilistic model

Xiaoxiao Liu,Yan Zhao,Shigang Wang,Jian Wei

DOI: https://doi.org/10.1007/s10489-024-05496-w

IF: 5.3

2024-05-19

Applied Intelligence

Abstract:Medical image segmentation can provide a reliable basis for clinical analysis and diagnosis. However, this task is challenging due to the low contrast, boundary ambiguity between organs or lesions and surrounding tissues, and noise interference of images. To address this challenge, which is unique to medical images, and further improve the segmentation accuracy and precision, a medical image segmentation model (TransDiff) is proposed from the perspective of improving model robustness and enriching semantic information. TransDiff comprises three parts: a variational autoencoder (VAE), a diffusion transformer model and a Swin Transformer. The VAE constructs a latent space to provide an environment for fully extracting and fusing features. The diffusion model predicts and removes noise by inferring semantics through the propagation of information between nodes. The Swin Transformer enriches discriminative features as a conditional part. TransDiff inherits the robustness to noise and missing data of the diffusion model and the feature enrichment of the Swin Transformer, thus exhibiting a higher understanding of semantic information. It performs well on medical datasets with three different image modalities, outperforms existing medical image segmentation methods in terms of segmentation precision and accuracy, and has good generalizability. The codes and trained models will be publicly available at https://github.com/xiaoxiao1997/TransDiff.

computer science, artificial intelligence
Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation

Zhiqing Zhang,Guojia Fan,Tianyong Liu,Nan Li,Yuyang Liu,Ziyu Liu,Canwei Dong,Shoujun Zhou

2023-09-12

Abstract:Medical image segmentation is critical for diagnosing and treating spinal disorders. However, the presence of high noise, ambiguity, and uncertainty makes this task highly challenging. Factors such as unclear anatomical boundaries, inter-class similarities, and irrational annotations contribute to this challenge. Achieving both accurate and diverse segmentation templates is essential to support radiologists in clinical practice. In recent years, denoising diffusion probabilistic modeling (DDPM) has emerged as a prominent research topic in computer vision. It has demonstrated effectiveness in various vision tasks, including image deblurring, super-resolution, anomaly detection, and even semantic representation generation at the pixel level. Despite the robustness of existing diffusion models in visual generation tasks, they still struggle with discrete masks and their various effects. To address the need for accurate and diverse spine medical image segmentation templates, we propose an end-to-end framework called VerseDiff-UNet, which leverages the denoising diffusion probabilistic model (DDPM). Our approach integrates the diffusion model into a standard U-shaped architecture. At each step, we combine the noise-added image with the labeled mask to guide the diffusion direction accurately towards the target region. Furthermore, to capture specific anatomical a priori information in medical images, we incorporate a shape a priori module. This module efficiently extracts structural semantic information from the input spine images. We evaluate our method on a single dataset of spine images acquired through X-ray imaging. Our results demonstrate that VerseDiff-UNet significantly outperforms other state-of-the-art methods in terms of accuracy while preserving the natural features and variations of anatomy.

Image and Video Processing,Computer Vision and Pattern Recognition
TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation

Pengfei Song,Jinjiang Li,Hui Fan,Linwei Fan

DOI: https://doi.org/10.1016/j.compbiomed.2023.107583

Abstract:Accurate and automatic segmentation of medical images is a key step in clinical diagnosis and analysis. Currently, the successful application of Transformers' model in the field of computer vision, researchers have begun to gradually explore the application of Transformers in medical segmentation of images, especially in combination with convolutional neural networks with coding-decoding structure, which have achieved remarkable results in the field of medical segmentation. However, most studies have combined Transformers with CNNs at a single scale or processed only the highest-level semantic feature information, ignoring the rich location information in the lower-level semantic feature information. At the same time, for problems such as blurred structural boundaries and heterogeneous textures in images, most existing methods usually simply connect contour information to capture the boundaries of the target. However, these methods cannot capture the precise outline of the target and ignore the potential relationship between the boundary and the region. In this paper, we propose the TGDAUNet, which consists of a dual-branch backbone network of CNNs and Transformers and a parallel attention mechanism, to achieve accurate segmentation of lesions in medical images. Firstly, high-level semantic feature information of the CNN backbone branches is fused at multiple scales, and the high-level and low-level feature information complement each other's location and spatial information. We further use the polarised self-attentive (PSA) module to reduce the impact of redundant information caused by multiple scales, to better couple with the feature information extracted from the Transformers backbone branch, and to establish global contextual long-range dependencies at multiple scales. In addition, we have designed the Reverse Graph-reasoned Fusion (RGF) module and the Feature Aggregation (FA) module to jointly guide the global context. The FA module aggregates high-level semantic feature information to generate an original global predictive segmentation map. The RGF module captures non-significant features of the boundaries in the original or secondary global prediction segmentation graph through a reverse attention mechanism, establishing a graph reasoning module to explore the potential semantic relationships between boundaries and regions, further refining the target boundaries. Finally, to validate the effectiveness of our proposed method, we compare our proposed method with the current popular methods in the CVC-ClinicDB, Kvasir-SEG, ETIS, CVC-ColonDB, CVC-300,datasets as well as the skin cancer segmentation datasets ISIC-2016 and ISIC-2017. The large number of experimental results show that our method outperforms the currently popular methods. Source code is released at https://github.com/sd-spf/TGDAUNet.

Peanut lectin histochemistry of 120 mammary carcinomas and its relation to tumor type, grading, staging, and receptor status

Diffusion model-based text-guided enhancement network for medical image segmentation

Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models

HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation

Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models

FDiff-Fusion:Denoising diffusion fusion network based on fuzzy learning for 3D medical image segmentation

Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation

Aspirin in the prevention of cardiovascular disease in women.

UNet based on dynamic convolution decomposition and triplet attention

Localization of hemoptysis in patients with cystic fibrosis.

FDiff-Fusion: Denoising Diffusion Fusion Network Based on Fuzzy Learning for 3D Medical Image Segmentation

Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer

Segmentation of medical images using an attention embedded lightweight network

BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation

MTANet: Multi-Task Attention Network for Automatic Medical Image Segmentation and Classification

TA-Net: Triple attention network for medical image segmentation

TransDiffSeg: Transformer-Based Conditional Diffusion Segmentation Model for Abdominal Multi-Objective

Verdiff-Net: A Conditional Diffusion Framework for Spinal Medical Image Segmentation

TransDiff: medical image segmentation method based on Swin Transformer with diffusion probabilistic model

Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation

TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation