Abstract:Currently, the development of deep learning-based multimodal learning is advancing rapidly, and is widely used in the field of artificial intelligence-generated content, such as image-text conversion and image-text generation. Electronic health records are digital information such as numbers, charts, and texts generated by medical staff using information systems in the process of medical activities. The multimodal fusion method of electronic health records based on deep learning can assist medical staff in the medical field to comprehensively analyze a large number of medical multimodal data generated in the process of diagnosis and treatment, thereby achieving accurate diagnosis and timely intervention for patients. In this article, we firstly introduce the methods and development trends of deep learning-based multimodal data fusion. Secondly, we summarize and compare the fusion of structured electronic medical records with other medical data such as images and texts, focusing on the clinical application types, sample sizes, and the fusion methods involved in the research. Through the analysis and summary of the literature, the deep learning methods for fusion of different medical modal data are as follows: first, selecting the appropriate pre-trained model according to the data modality for feature representation and post-fusion, and secondly, fusing based on the attention mechanism. Lastly, the difficulties encountered in multimodal medical data fusion and its developmental directions, including modeling methods, evaluation and application of models, are discussed. Through this review article, we expect to provide reference information for the establishment of models that can comprehensively utilize various modal medical data.

A review: Deep learning for medical image segmentation using multi-modality fusion

Deep learning methods for medical image fusion: A review

A review of deep learning-based information fusion techniques for multimodal medical image classification

Deep Learning-Based Image Segmentation on Multimodal Medical Imaging

A Review of Multimodal Medical Image Fusion Techniques

Multimodal Medical Image Fusion: The Perspective of Deep Learning

Medical Image Segmentation Based on Multi-Modal Convolutional Neural Network: Study on Image Fusion Schemes

Deep Multi-modal Fusion of Image and Non-image Data in Disease Diagnosis and Prognosis: A Review

A review of deep learning approaches for multimodal image segmentation of liver cancer

Medical image segmentation based on self-supervised hybrid fusion network

Polyamines and plant disease.

Multimodal deep learning for biomedical data fusion: a review

A Review of Deep-Learning-Based Medical Image Segmentation Methods

A Review of the Application of Multi-modal Deep Learning in Medicine: Bibliometrics and Future Directions

Advances in Medical Image Segmentation: A Comprehensive Review of Traditional, Deep Learning and Hybrid Approaches

Multimodal medical image fusion review: Theoretical background and recent advances

Deep Learning Based Multimodal Biomedical Data Fusion: an Overview and Comparative Review

[Research progress on electronic health records multimodal data fusion based on deep learning]

Deep Fusion of Shifted MLP and CNN for Medical Image Segmentation.

Multimodal Medical Imaging Using Modern Deep Learning Approaches

Multimodal medical image fusion and classification using deep learning techniques