Few-Shot Classification with Multiscale Feature Fusion for Clinical Skin Disease Diagnosis
Tianle Chen,Qi Liu,Jie Yang
DOI: https://doi.org/10.2147/ccid.s458255
2024-05-07
Clinical Cosmetic and Investigational Dermatology
Abstract:Tianle Chen, Qi Liu, Jie Yang College of Biomedical Engineering, Sichuan University, Chengdu, 610065, People's Republic of China Correspondence: Qi Liu, College of Biomedical Engineering, Sichuan University, Chengdu, 610065, People's Republic of China, Email Introduction: Skin disease is one of the most common diseases and can affect people of all ages and races. However, the diagnosis of skin diseases via observation is a highly challenging task for both doctors and patients, and would benefit from the use of an intelligent system. Building a large benchmark with professional dermatologists is resource-intensive, and we believe that few-shot learning (FSL) methods would be helpful in solving the problem of annotated data scarcity. In this paper, we propose CDD-Net (Context Feature Fusion and Dual Attention Dermatology Net), a plug-in module for FSL clinical skin disease classification. Methods: Current FSL methods used in skin disease classification are limited to nonuniversal approaches and few disease classes. Our CDD-Net has a flexible structure, including a context feature–fusion module and dual-attention module to extract discriminating texture feature and emphasize contributive regions and channels. The context feature–fusion module localizes discriminatory texture details of skin lesions by integrating features from different layers, while the dual-attention module highlights discriminative regions via channel-wise and pixel-wise depictions based on weight vectors and restrains the contributions of irrelevant areas. We also present Derm104, a new clinical skin disease data benchmark that has significant coverage of rare diseases and reliable annotation between primary species and subspecies for better validation of our approach. Results: Our experiments validated the versatility of CDD-Net for different FSL methods and achieved an improvement in accuracy of up to 9.14 percentage points compared with the vanilla network, which can be considered state of the art. The ablation study also showed that the dual-attention module and context feature–fusion module worked efficiently in CDD-Net. Keywords: clinical skin image, computer-aided diagnosis, few-shot learning, feature fusion, attention mechanism Graphical Skin disease is one of the most common diseases and can affect people of all ages and races. 1 There are many problems that skin diseases can cause to patients, including itching and bleeding, which can seriously affect their quality of life or even cause them to lose their lives. Skin diseases should be diagnosed early so that they can receive the correct treatment as soon as possible and avoid further progression. Delayed diagnosis can be attributed to the limited medical knowledge of patients and disparity in medical resources. We can solve some of these problems with the help of computer-aided diagnosis to a certain extent. Dermoscopic images are the primary focus of early investigations using computer-aided diagnosis for skin diseases. 2,3 This is due to the fact that they focus more on lesions than clinical images with uniform illumination and less noise. However, dermoscopic-based diagnosis of skin diseases has some restrictions, such as high costs and low convenience. In recent years, some researchers have begun to pay more attention to clinical images. 4, 5 Differences between clinical images and other images are demonstrated in Figure 1. Figure 1 Comparison of clinical skin images ( a – c ), dermoscopic images ( d – f ), and tissue-biopsy pathology images ( g – i ). The gold standard for skin disease diagnosis is based on tissue-biopsy pathology images, but high-quality equipment and testing techniques are required. Dermoscopic images are obtained using a microscope, which can only test lesions in block or dot shape. Clinical images are influenced by the angle of photography and the intensity of light, and are almost always obtained under different lighting conditions and uneven focal lengths of the lesion, resulting in greater external interference. In recent years, the fields of feature learning and object recognition have experienced tremendous growth in the use of convolution neural networks (CNNs). According to numerous studies from ImageNet's large-scale visual recognition challenge, the most sophisticated CNN has outperformed humans on object-classification tasks. 6–8 Due to its exceptional performance over traditional methods, deep CNN-based learning is also frequently utilized in skin disease classification, 9,10 lesion localization, and segmentation tasks, 11–14 A majority of these tasks showed a high standard of accura -Abstract Truncated-
dermatology