Abstract:Purpose: Computer-aided diagnosis (CAD) can aid in improving diagnostic level; however, the main problem currently faced by CAD is that it cannot obtain sufficient labeled samples. To solve this problem, in this study, we adopt a generative adversarial network (GAN) approach and design a semisupervised learning algorithm, named G2C-CAD. Methods: From the National Cancer Institute (NCI) Lung Image Database Consortium (LIDC) dataset, we extracted four types of pulmonary nodule sign images closely related to lung cancer: noncentral calcification, lobulation, spiculation, and nonsolid/ground-glass opacity (GGO) texture, obtaining a total of 3,196 samples. In addition, we randomly selected 2,000 non-lesion image blocks as negative samples. We split the data 90% for training and 10% for testing. We designed a DCGAN generative adversarial framework and trained it on the small sample set. We also trained our designed CNN-based fuzzy Co-forest on the labeled small sample set and obtained a preliminary classifier. Then, coupled with the simulated unlabeled samples generated by the trained DCGAN, we conducted iterative semisupervised learning, which continually improved the classification performance of the fuzzy Co-forest until the termination condition was reached. Finally, we tested the fuzzy Co-forest and compared its performance with that of a C4.5 random decision forest and the G2C-CAD system without the fuzzy scheme, using ROC and confusion matrix for evaluation. Results: Four different types of lung cancer-related signs were used in the classification experiment: noncentral calcification, lobulation, spiculation, and nonsolid/ground-glass opacity (GGO) texture, along with negative image samples. For these five classes, the G2C-CAD system obtained AUCs of 0.946, 0.912, 0.908, 0.887, and 0.939, respectively. The average accuracy of G2C-CAD exceeded that of the C4.5 random decision tree by 14%. G2C-CAD also obtained promising test results on the LISS signs dataset; its AUCs for GGO, lobulation, spiculation, pleural indentation, and negative image samples were 0.972, 0.964, 0.941, 0.967, and 0.953, respectively. Conclusion: The experimental results show that G2C-CAD is an appropriate method for addressing the problem of insufficient labeled samples in the medical image analysis field. Moreover, our system can be used to establish a training sample library for CAD classification diagnosis, which is important for future medical image analysis.

Improve Computer-Aided Diagnosis with Machine Learning Techniques Using Undiagnosed Samples

Improve Computer-Aided Diagnosis with Machine

Coarse-to-Fine Classification via Parametric and Nonparametric Models for Computer-Aided Diagnosis

A Novel Computer-Aided Diagnosis Scheme on Small Annotated Set: G2C-CAD

Fusing Medical Image Features and Clinical Features with Deep Learning for Computer-Aided Diagnosis

Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans

Learning algorithm with non-balanced data for computer-aided diagnosis of breast cancer

Improving Computer-aided Detection using Convolutional Neural Networks and Random View Aggregation

A Hybrid Deep Learning Approach to Predict Malignancy of Breast Lesions Using Mammograms

Interpretative Computer-aided Lung Cancer Diagnosis: from Radiology Analysis to Malignancy Evaluation

Computer-Aided Diagnosis (CAD) of Pulmonary Nodule of Thoracic CT Image Using Transfer Learning

A Computer-Aided Diagnosis System for Breast Pathology: A Deep Learning Approach with Model Interpretability from Pathological Perspective

Computer-aided diagnostic system based on deep learning for classifying colposcopy images

Colorectal cancer diagnosis from histology images: A comparative study

Computer-Aided Dementia Diagnosis Based on Hierarchical Extreme Learning Machine

Collaborative Unsupervised Domain Adaptation for Medical Image Diagnosis

Computer-aided diagnosis in medical imaging: Historical review, current status and future potential

Automatic feature learning using multichannel ROI based on deep structured algorithms for computerized lung cancer diagnosis

Weakly Supervised Lesion Detection and Diagnosis for Breast Cancers with Partially Annotated Ultrasound Images

Recent advancement in Disease Diagnostic using machine learning: Systematic survey of decades, comparisons, and challenges

Active Semi-Supervised Learning via Bayesian Experimental Design for Lung Cancer Classification Using Low Dose Computed Tomography Scans