Abstract:Abstract The detailed physiological perspectives captured by medical imaging provides actionable insights to doctors to manage comprehensive care of patients. However, the quality of such diagnostic image modalities is often affected by mismanagement of the image capturing process by poorly trained technicians and older/poorly maintained imaging equipment. Further, a patient is often subjected to scanning at different orientations to capture the frontal, lateral and sagittal views of the affected areas. Due to the large volume of diagnostic scans performed at a modern hospital, adequate documentation of such additional perspectives is mostly overlooked, which is also an essential key element of quality diagnostic systems and predictive analytics systems. Another crucial challenge affecting effective medical image data management is that the diagnostic scans are essentially stored as unstructured data, lacking a well-defined processing methodology for enabling intelligent image data management for supporting applications like similar patient retrieval , automated disease prediction etc. One solution is to incorporate automated diagnostic image descriptions of the observation/findings by leveraging computer vision and natural language processing. In this work, we present multi-task neural models capable of addressing these critical challenges. We propose ESRGAN, an image enhancement technique for improving the quality and visualization of medical chest x-ray images, thereby substantially improving the potential for accurate diagnosis, automatic detection and region-of-interest segmentation. We also propose a CNN-based model called ViewNet for predicting the view orientation of the x-ray image and generating a medical report using Xception net, thus facilitating a robust medical image management system for intelligent diagnosis applications. Experimental results are demonstrated using standard metrics like BRISQUE, PIQE and BLEU scores, indicating that the proposed models achieved excellent performance. Further, the proposed deep learning approaches enable diagnosis in a lesser time and their hybrid architecture shows significant potential for supporting many intelligent diagnosis applications.

Fostering transparent medical image AI via an image-text foundation model grounded in medical literature

Transparent medical image AI via an image–text foundation model grounded in medical literature

Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain

MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging

Deep neural models for automated multi-task diagnostic scan management—quality enhancement, view classification and report generation

VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis

Auditing the inference processes of medical-image classifiers by leveraging generative AI and the expertise of physicians

A Framework for Evaluating the Efficacy of Foundation Embedding Models in Healthcare

MINT: A wrapper to make multi-modal and multi-image AI models interactive

Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM Empowerment

MONAI: An open-source framework for deep learning in healthcare

Explainable AI for Medical Image Analysis in Medical Cyber-Physical Systems: Enhancing Transparency and Trustworthiness of IoMT

Concept-Attention Whitening for Interpretable Skin Lesion Diagnosis

Towards Scalable Foundation Models for Digital Dermatology

MONAI Label: A framework for AI-assisted interactive labeling of 3D medical images

MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept Alignment

Leveraging Computational Pathology AI for Noninvasive Optical Imaging Analysis Without Retraining

Toward Transparent AI for Neurological Disorders: A Feature Extraction and Relevance Analysis Framework

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

Foundation AI Model for Medical Image Segmentation

Debiased Noise Editing on Foundation Models for Fair Medical Image Classification