Abstract:Tianle Chen, Qi Liu, Jie Yang College of Biomedical Engineering, Sichuan University, Chengdu, 610065, People's Republic of China Correspondence: Qi Liu, College of Biomedical Engineering, Sichuan University, Chengdu, 610065, People's Republic of China, Email Introduction: Skin disease is one of the most common diseases and can affect people of all ages and races. However, the diagnosis of skin diseases via observation is a highly challenging task for both doctors and patients, and would benefit from the use of an intelligent system. Building a large benchmark with professional dermatologists is resource-intensive, and we believe that few-shot learning (FSL) methods would be helpful in solving the problem of annotated data scarcity. In this paper, we propose CDD-Net (Context Feature Fusion and Dual Attention Dermatology Net), a plug-in module for FSL clinical skin disease classification. Methods: Current FSL methods used in skin disease classification are limited to nonuniversal approaches and few disease classes. Our CDD-Net has a flexible structure, including a context feature–fusion module and dual-attention module to extract discriminating texture feature and emphasize contributive regions and channels. The context feature–fusion module localizes discriminatory texture details of skin lesions by integrating features from different layers, while the dual-attention module highlights discriminative regions via channel-wise and pixel-wise depictions based on weight vectors and restrains the contributions of irrelevant areas. We also present Derm104, a new clinical skin disease data benchmark that has significant coverage of rare diseases and reliable annotation between primary species and subspecies for better validation of our approach. Results: Our experiments validated the versatility of CDD-Net for different FSL methods and achieved an improvement in accuracy of up to 9.14 percentage points compared with the vanilla network, which can be considered state of the art. The ablation study also showed that the dual-attention module and context feature–fusion module worked efficiently in CDD-Net. Keywords: clinical skin image, computer-aided diagnosis, few-shot learning, feature fusion, attention mechanism Graphical Skin disease is one of the most common diseases and can affect people of all ages and races. 1 There are many problems that skin diseases can cause to patients, including itching and bleeding, which can seriously affect their quality of life or even cause them to lose their lives. Skin diseases should be diagnosed early so that they can receive the correct treatment as soon as possible and avoid further progression. Delayed diagnosis can be attributed to the limited medical knowledge of patients and disparity in medical resources. We can solve some of these problems with the help of computer-aided diagnosis to a certain extent. Dermoscopic images are the primary focus of early investigations using computer-aided diagnosis for skin diseases. 2,3 This is due to the fact that they focus more on lesions than clinical images with uniform illumination and less noise. However, dermoscopic-based diagnosis of skin diseases has some restrictions, such as high costs and low convenience. In recent years, some researchers have begun to pay more attention to clinical images. 4, 5 Differences between clinical images and other images are demonstrated in Figure 1. Figure 1 Comparison of clinical skin images ( a – c ), dermoscopic images ( d – f ), and tissue-biopsy pathology images ( g – i ). The gold standard for skin disease diagnosis is based on tissue-biopsy pathology images, but high-quality equipment and testing techniques are required. Dermoscopic images are obtained using a microscope, which can only test lesions in block or dot shape. Clinical images are influenced by the angle of photography and the intensity of light, and are almost always obtained under different lighting conditions and uneven focal lengths of the lesion, resulting in greater external interference. In recent years, the fields of feature learning and object recognition have experienced tremendous growth in the use of convolution neural networks (CNNs). According to numerous studies from ImageNet's large-scale visual recognition challenge, the most sophisticated CNN has outperformed humans on object-classification tasks. 6–8 Due to its exceptional performance over traditional methods, deep CNN-based learning is also frequently utilized in skin disease classification, 9,10 lesion localization, and segmentation tasks, 11–14 A majority of these tasks showed a high standard of accura -Abstract Truncated-

Pay Less On Clinical Images: Asymmetric Multi-Modal Fusion Method For Efficient Multi-Label Skin Lesion Classification

Single-Shared Network with Prior-Inspired Loss for Parameter-Efficient Multi-Modal Imaging Skin Lesion Classification

Skin Lesion classification based on two-modal images using a multi-scale fully-shared fusion network

Joint-individual fusion structure with fusion attention module for multi-modal skin cancer classification

A Novel Perspective for Multi-modal Multi-label Skin Lesion Classification

MSMA: A multi-stage and multi-attention algorithm for the classification of multimodal skin lesions

CS-AF: A Cost-sensitive Multi-classifier Active Fusion Framework for Skin Lesion Classification

Transformer-based interpretable multi-modal data fusion for skin lesion classification

Rectus sparing approach to left ventricular assist device exchange and use of the omental flap for coverage.

Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary Task Integration

MASDF-Net: A Multi-Attention Codec Network with Selective and Dynamic Fusion for Skin Lesion Segmentation

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

Self-Supervised Multi-Modality Learning for Multi-Label Skin Lesion Classification

RemixFormer++: A Multi-modal Transformer Model for Precision Skin Tumor Differential Diagnosis with Memory-efficient Attention

A Novel Transfer Learning Framework for Multimodal Skin Lesion Analysis

Few-Shot Classification with Multiscale Feature Fusion for Clinical Skin Disease Diagnosis

MLFF-Net: a multi-model late feature fusion network for skin disease classification

MDFNet: application of multimodal fusion method based on skin image and clinical data to skin cancer classification

Graph-Ensemble Learning Model for Multi-label Skin Lesion Classification using Dermoscopy and Clinical Images

[Molecular profiling of non-small cell lung cancer].