Abstract:Deep learning-based computer-aided diagnosis techniques have demonstrated encouraging performance in endoscopic lesion identification and detection, and have reduced the rate of missed and false detections of disease during endoscopy. However, the interpretability of the model-based results has not been adequately addressed by existing methods. This phenomenon is directly manifested by a significant bias in the representation of feature localization. Good recognition models experience severe feature localization errors, particularly for lesions with subtle morphological features, and such unsatisfactory performance hinders the clinical deployment of models. To effectively alleviate this problem, we proposed a solution to optimize the localization bias in feature representations of cancer-related recognition models that is difficult to accurately label and identify in clinical practice. Optimization was performed in the training phase of the model through the proposed data augmentation method and auxiliary loss function based on clinical priors. The data augmentation method, called partial jigsaw, can “break” the spatial structure of lesion-independent image blocks and enrich the data feature space to decouple the interference of background features on the space and focus on fine-grained lesion features. The annotation-based auxiliary loss function used class activation maps for sample distribution correction and led the model to present localization representation converging on the gold standard annotation of visualization maps. The results show that with the improvement of our method, the precision of model recognition reached an average of 92.79%, an F1-score of 92.61%, and accuracy of 95.56% based on a dataset constructed from 23 hospitals. In addition, we quantified the evaluation representation of visualization feature maps. The improved model yielded significant offset correction results for visualized feature maps compared with the baseline model. The average visualization-weighted positive coverage improved from 51.85% to 83.76%. The proposed approach did not change the deployment capability and inference speed of the original model and can be incorporated into any state-of-the-art neural network. It also shows the potential to provide more accurate localization inference results and assist in clinical examinations during endoscopies.

Generalizable Feature Learning in the Presence of Data Bias and Domain Class Imbalance with Application to Skin Lesion Classification

IRLSG: Invariant Representation Learning for Single-Domain Generalization in Medical Image Segmentation

Artifact-Based Domain Generalization of Skin Lesion Models

Multi-label Recognition of Cancer-Related Lesions with Clinical Priors on White-Light Endoscopy

Enhancing the Generalization Capability of Skin Lesion Classification Models with Active Domain Adaptation Methods

Consistent representation via contrastive learning for skin lesion diagnosis

Single Model Deep Learning on Imbalanced Small Datasets for Skin Lesion Classification

CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin Lesions

Achieve Fairness without Demographics for Dermatological Disease Diagnosis

Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization

Mitigating the Influence of Domain Shift in Skin Lesion Classification: A Benchmark Study of Unsupervised Domain Adaptation Methods on Dermoscopic Images

Achieving Reliable and Fair Skin Lesion Diagnosis via Unsupervised Domain Adaptation

Rescuing referral failures during automated diagnosis of domain-shifted medical images

Universal Medical Imaging Model for Domain Generalization with Data Privacy

Adversarial Training Based Domain Adaptation of Skin Cancer Images

Domain shifts in dermoscopic skin cancer datasets: Evaluation of essential limitations for clinical translation

Privacy-Preserving Constrained Domain Generalization via Gradient Alignment

Understanding skin color bias in deep learning-based skin lesion segmentation

Automatic Skin Lesion Segmentation Using Deep Fully Convolutional Networks With Jaccard Distance

An Efficient Deep Learning-Based Skin Cancer Classifier for an Imbalanced Dataset

Toward Generalizable Multiple Sclerosis Lesion Segmentation Models