Abstract:Universal lesion detection has great value for clinical practice as it aims to detect various types of lesions in multiple organs on medical images. Deep learning methods have shown promising results, but demanding large volumes of annotated data for training. However, annotating medical images is costly and requires specialized knowledge. The diverse forms and contrasts of objects in medical images make fully annotation even more challenging, resulting in incomplete annotations. Directly training ULD detectors on such datasets can yield suboptimal results. Pseudo-label-based methods examine the training data and mine unlabelled objects for retraining, which have shown to be effective to tackle this issue. Presently, top-performing methods rely on a dynamic label-mining mechanism, operating at the mini-batch level. However, the model's performance varies at different iterations, leading to inconsistencies in the quality of the mined labels and limits their performance enhancement. Inspired by the observation that deep models learn concepts with increasing complexity, we introduce an innovative exploratory training to assess the reliability of mined lesions over time. Specifically, we introduce a teacher-student detection model as basis, where the teacher's predictions are combined with incomplete annotations to train the student. Additionally, we design a prediction bank to record high-confidence predictions. Each sample is trained several times, allowing us to get a sequence of records for each sample. If a prediction consistently appears in the record sequence, it is likely to be a true object, otherwise it may just a noise. This serves as a crucial criterion for selecting reliable mined lesions for retraining. Our experimental results substantiate that the proposed framework surpasses state-of-the-art methods on two medical image datasets, demonstrating its superior performance.

How Many Annotations Do We Need for Generalizing New-Coming Shadow Images?

Annotate less but perform better: weakly supervised shadow detection via label augmentation

Attention Res-Unet: an Efficient Shadow Detection Algorithm

Boosting sparsely annotated shadow detection

Exploring better sparsely annotated shadow detection

ADeLA: Automatic Dense Labeling with Attention for Viewpoint Shift in Semantic Segmentation

Exploring Better Target for Shadow Detection.

Light-weight shadow detection via GCN-based annotation strategy and knowledge distillation

Augment and Criticize: Exploring Informative Samples for Semi-Supervised Monocular 3D Object Detection

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Semantic-aware Transformer for shadow detection

Are Dense Labels Always Necessary for 3D Object Detection from Point Cloud?

Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation

Pre-Trained Vision-Language Models as Partial Annotators

Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation

Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation

Diff-Shadow: Global-guided Diffusion Model for Shadow Removal

SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows from Noisy Labels

Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

SemHint-MD: Learning from Noisy Semantic Labels for Self-Supervised Monocular Depth Estimation

Tackling the Incomplete Annotation Issue in Universal Lesion Detection Task By Exploratory Training