Multi-modal Medical Data Fusion using Deep Learning

B. Sandhya,D. Haritha
DOI: https://doi.org/10.23919/INDIACom54597.2022.9763296
2022-03-23
Abstract:Various modalities, each with a characteristic feature, have evolved over the years in the medical domain to aid the identification and diagnosis of the diseases. The focus of research in multi modal data analysis in medical domain has been to fuse complimentary information, primarily from images, addressing the great variability arising due to differences in method and time of acquisition. Most of the multi modal image analysis approaches using deep learning are CNN based, using convolution layers for image representation and hence employing early fusion strategy. Additional layers of the deep network are designed depending on the type of input images such as 2D or 3D, X-ray, MRI etc., type of deformations between images, and the target application. In practice, medical images are interpreted in the appropriate clinical context using other relevant data such as patient history and laboratory data. Coalescing information from multiple modalities such as images, structured laboratory data, unstructured narrative data can aid accurate diagnosis and efficient monitoring during treatment. Hence multi modal deep learning models can be used to ingest pixel data along with other structured and textual data to overcome the limitation of image-only models. The proposed work involves detecting type of skin lesion using images and patient clinical data. Experimental results on PAD-UFES-20 dataset verify that combining patient information and image features results in increase of accuracy of detection.
Medicine,Computer Science
What problem does this paper attempt to address?