Abstract:Background and objective Due to the complexity of skin lesion features, computer-aided diagnosis of skin diseases based on multi-modal images is considered a challenging task. Dermoscopic images and clinical images are commonly used to diagnose skin diseases in clinical scenarios, and the complementarity of their features promotes the research of multi-modality classification in the computer-aided diagnosis field. Most current methods focus on the fusion between modalities and ignore the complementary information within each of them, which leads to the loss of the intra-modality relation. Multi-modality models for integrating features both within single modalities and across multiple modalities are limited in the literature. Therefore, a multi-modality model based on dermoscopic and clinical images is proposed to address this issue. Methods We propose a Multi-scale Fully-shared Fusion Network (MFF-Net) that gathers features of dermoscopic images and clinical images for skin lesion classification. In MFF-Net, the multi-scale fusion structure combines deep and shallow features within individual modalities to reduce the loss of spatial information in high-level feature maps. Then Dermo-Clinical Block (DCB) integrates the feature maps from dermoscopic images and clinical images through channel-wise concatenation and using a fully-shared fusion strategy that explores complementary information at different stages. Results We validated our model on a four-class two-modal skin diseases dataset, and proved that the proposed multi-scale structure, the fusion module DCBs, and the fully-shared fusion strategy improve the performance of MFF-Net independently. Our method achieved the highest average accuracy of 72.9% on the 7-point checklist dataset, outperforming the state-of-the-art single-modality and multi-modality methods with an accuracy boost of 7.1% and 3.4%, respectively. Conclusions The multi-scale fusion structure demonstrates the significance of intra-modality relations between clinical images and dermoscopic images. The proposed network combined with the multi-scale structure, DCBs, and the fully-shared fusion strategy, can effectively integrate the features of the skin lesions across the two modalities and achieved a promising accuracy among different skin diseases.

Application of Multimodal Fusion Deep Learning Model in Disease Recognition

Integrating Medical Imaging and Clinical Reports Using Multimodal Deep Learning for Advanced Disease Analysis

Deep Multi-modal Fusion of Image and Non-image Data in Disease Diagnosis and Prognosis: A Review

Multimodal Medical Image Fusion: The Perspective of Deep Learning

Multimodal medical image fusion and classification using deep learning techniques

Integration of Multimodal Data for Breast Cancer Classification Using a Hybrid Deep Learning Method

Multimodal medical image fusion using convolutional neural network and extreme learning machine

Multimodal fusion with deep neural networks for leveraging CT imaging and electronic health record: a case-study in pulmonary embolism detection

A review of deep learning-based information fusion techniques for multimodal medical image classification

Deep learning and multimodal feature fusion for the aided diagnosis of Alzheimer's disease

Multimodal Fusion Learning with Dual Attention for Medical Imaging

Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data

Deep Learning Based Multimodal Biomedical Data Fusion: an Overview and Comparative Review

Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification

MFISN: Modality Fuzzy Information Separation Network for Disease Classification

Multi-modal medical image fusion based on densely-connected high-resolution CNN and hybrid transformer

Skin Lesion classification based on two-modal images using a multi-scale fully-shared fusion network

Richer fusion network for breast cancer classification based on multimodal data

MultiFusionNet: Multilayer Multimodal Fusion of Deep Neural Networks for Chest X-Ray Image Classification

CIRF: Coupled Image Reconstruction and Fusion Strategy for Deep Learning Based Multi-Modal Image Fusion

Multimodal Medical Imaging Using Modern Deep Learning Approaches