Abstract:Background: Data collected from hospitals are usually partially annotated by radiologists due to time constraints. Developing and evaluating deep learning models on these data may result in over or under estimation PURPOSE: We aimed to quantitatively investigate how the percentage of annotated lesions in CT images will influence the performance of universal lesion detection (ULD) algorithms. Methods: We trained a multi-view feature pyramid network with position-aware attention (MVP-Net) to perform ULD. Three versions of the DeepLesion dataset were created for training MVP-Net. Original DeepLesion Dataset (OriginalDL) is the publicly available, widely studied DeepLesion dataset that includes 32 735 lesions in 4427 patients which were partially labeled during routine clinical practice. Enriched DeepLesion Dataset (EnrichedDL) is an enhanced dataset that features fully labeled at one or more time points for 4145 patients with 34 317 lesions. UnionDL is the union of the OriginalDL and EnrichedDL with 54 510 labeled lesions in 4427 patients. Each dataset was used separately to train MVP-Net, resulting in the following models: OriginalCNN (replicating the original result), EnrichedCNN (testing the effect of increased annotation), and UnionCNN (featuring the greatest number of annotations). Results: Although the reported mean sensitivity of OriginalCNN was 84.3% using the OriginalDL testing set, the performance fell sharply when tested on the EnrichedDL testing set, yielding mean sensitivities of 56.1%, 66.0%, and 67.8% for OriginalCNN, EnrichedCNN, and UnionCNN, respectively. We also found that increasing the percentage of annotated lesions in the training set increased sensitivity, but the margin of increase in performance gradually diminished according to the power law. Conclusions: We expanded and improved the existing DeepLesion dataset by annotating additional 21 775 lesions, and we demonstrated that using fully labeled CT images avoided overestimation of MVP-Net's performance while increasing the algorithm's sensitivity, which may have a huge impact to the future CT lesion detection research. The annotated lesions are at https://github.com/ComputationalImageAnalysisLab/DeepLesionData.

Annotation quality vs. quantity for deep-learned medical image segmentation

Crowdsourcing image segmentation for deep learning: integrated platform for citizen science, paid microtask, and gamification

Expert-Level Annotation Quality Achieved by Gamified Crowdsourcing for B-line Segmentation in Lung Ultrasound

Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Gamification concept for acquisition of medical image segmentation via crowdsourcing

Reliable Mutual Distillation for Medical Image Segmentation under Imperfect Annotations

Coupling AI and Citizen Science in Creation of Enhanced Training Dataset for Medical Image Segmentation

A sparse annotation strategy based on attention-guided active learning for 3D medical image segmentation

Technical note: The effect of image annotation with minimal manual interaction for semiautomatic prostate segmentation in CT images using fully convolutional neural networks

Annotation Cost Minimization for Ultrasound Image Segmentation Using Cross-domain Transfer Learning.

Robustness study of noisy annotation in deep learning based medical image segmentation

Embracing imperfect datasets: A review of deep learning solutions for medical image segmentation

A quantitative analysis of the improvement provided by comprehensive annotation on CT lesion detection using deep learning

SUSAN: Segment Unannotated image Structure using Adversarial Network

How to select slices for annotation to train best-performing deep learning segmentation models for cross-sectional medical images?

Progressive Medical Image Annotation with Convolutional Neural Network-Based Interactive Segmentation Method

Efforts estimation of doctors annotating medical image

Guidelines for Cerebrovascular Segmentation: Managing Imperfect Annotations in the context of Semi-Supervised Learning

Quality Sentinel: Estimating Label Quality and Errors in Medical Segmentation Datasets

Annotation-Efficient Learning for Medical Image Segmentation Based on Noisy Pseudo Labels and Adversarial Learning