Abstract:Background: Deep learning (DL) techniques have been extensively applied in medical image classification. The unique characteristics of medical imaging data present challenges, including small labeled datasets, severely imbalanced class distribution, and significant variations in imaging quality. Recently, generative adversarial network (GAN)-based classification methods have gained attention for their ability to enhance classification accuracy by incorporating realistic GAN-generated images as data augmentation. However, the performance of these GAN-based methods often relies on high-quality generated images, while large amounts of training data are required to train GAN models to achieve optimal performance. Purpose: In this study, we propose an adversarial learning-based classification framework to achieve better classification performance. Innovatively, GAN models are employed as supplementary regularization terms to support classification, aiming to address the challenges described above. Methods: The proposed classification framework, GAN-DL, consists of a feature extraction network (F-Net), a classifier, and two adversarial networks, specifically a reconstruction network (R-Net) and a discriminator network (D-Net). The F-Net extracts features from input images, and the classifier uses these features for classification tasks. R-Net and D-Net have been designed following the GAN architecture. R-Net employs the extracted feature to reconstruct the original images, while D-Net is tasked with the discrimination between the reconstructed image and the original images. An iterative adversarial learning strategy is designed to guide model training by incorporating multiple network-specific loss functions. These loss functions, serving as supplementary regularization, are automatically derived during the reconstruction process and require no additional data annotation. Results: To verify the model's effectiveness, we performed experiments on two datasets, including a COVID-19 dataset with 13 958 chest x-ray images and an oropharyngeal squamous cell carcinoma (OPSCC) dataset with 3255 positron emission tomography images. Thirteen classic DL-based classification methods were implemented on the same datasets for comparison. Performance metrics included precision, sensitivity, specificity, and F 1 $F_1$ -score. In addition, we conducted ablation studies to assess the effects of various factors on model performance, including the network depth of F-Net, training image size, training dataset size, and loss function design. Our method achieved superior performance than all comparative methods. On the COVID-19 dataset, our method achieved 95.4 % ± 0.6 % $95.4\%\pm 0.6\%$ , 95.3 % ± 0.9 % $95.3\%\pm 0.9\%$ , 97.7 % ± 0.4 % $97.7\%\pm 0.4\%$ , and 95.3 % ± 0.9 % $95.3\%\pm 0.9\%$ in terms of precision, sensitivity, specificity, and F 1 $F_1$ -score, respectively. It achieved 96.2 % ± 0.7 % $96.2\%\pm 0.7\%$ across all these metrics on the OPSCC dataset. The study to investigate the effects of two adversarial networks highlights the crucial role of D-Net in improving model performance. Ablation studies further provide an in-depth understanding of our methodology. Conclusion: Our adversarial-based classification framework leverages GAN-based adversarial networks and an iterative adversarial learning strategy to harness supplementary regularization during training. This design significantly enhances classification accuracy and mitigates overfitting issues in medical image datasets. Moreover, its modular design not only demonstrates flexibility but also indicates its potential applicability to various clinical contexts and medical imaging applications.

Dual-Path Adversarial Learning for Fully Convolutional Network (FCN)-Based Medical Image Segmentation

SegAN: Adversarial Network with Multi-scale $L_1$ Loss for Medical Image Segmentation

Stacked fully convolutional networks with multi-channel learning: application to medical image segmentation

DCFNet: An Effective Dual-Branch Cross-Attention Fusion Network for Medical Image Segmentation

AM-AN: Adversarial Network Based on Attention Mechanism for Medical Image Segmentation

Multi-Model Medical Image Segmentation Using Multi-Stage Generative Adversarial Network

Self-Adaptive 2D-3D Ensemble of Fully Convolutional Networks for Medical Image Segmentation

A Generative Adversarial Network Fused with Dual-Attention Mechanism and Its Application in Multitarget Image Fine Segmentation

Automated Segmentation of the Optic Disk and Cup using Dual-Stage Fully Convolutional Networks

Generative Adversarial Networks for Pre-training of Medical Image Segmentation Networks

Scale-aware Super-resolution Network with Dual Affinity Learning for Lesion Segmentation from Medical Images

A medical image classification method based on self-regularized adversarial learning

Learning With Context Feedback Loop for Robust Medical Image Segmentation

Deep Learning-Based Image Segmentation on Multimodal Medical Imaging

ResGANet: Residual Group Attention Network for Medical Image Classification and Segmentation

SUSAN: Segment Unannotated image Structure using Adversarial Network

Cross-Modality Medical Image Segmentation via Enhanced Feature Alignment and Cross Pseudo Supervision Learning

DCACNet: Dual context aggregation and attention-guided cross deconvolution network for medical image segmentation

Deep Generative Adversarial Reinforcement Learning for Semi-Supervised Segmentation of Low-Contrast and Small Objects in Medical Images

A novel adversarial learning strategy for medical image classification

Annotation-Efficient Learning for Medical Image Segmentation Based on Noisy Pseudo Labels and Adversarial Learning