Abstract:Background and objective: Medical imaging techniques are widely employed in disease diagnosis and treatment. A readily available medical report can be a useful tool in assisting an expert for investigating the patient's health. A radiologist can benefit from an automatic medical image to radiological report translation system while preparing a final report. Previous attempts on automatic medical report generation task includes image captioning algorithms without taking domain-specific visual and textual contents into account, thus arises the question about credibility of generated report. Methods: In this work, a novel Adaptive Multilevel Multi-Attention (AMLMA) approach is proposed by offering domain-specific visual-textual knowledge to generate a thorough and believable radiological report for any view of a human chest X-ray image. The proposed approach leverages the encoder-decoder framework incorporated with multiple adaptive attention mechanisms. The potential of a convolutional neural network (CNN) with residual attention module (RAM) is demonstrated as a strong visual encoder for multi-label abnormality detection. The multilevel visual features (local and global) are extracted from proposed visual encoder to retrieve regional-level and abstract-level radiology-based semantic information. The Word2Vec and FastText word embeddings are trained on medical reports to acquire radiological knowledge and further used as textual encoders, feeding as input to Bi-directional Long Short Term Memory (Bi-LSTM) network to learn the co-relationship between medical terminologies in radiological reports. The AMLMA employs a weighted multilevel association of adaptive visual-semantic attention and visual-based linguistic attention mechanisms. This association of adaptive attention is exploited as a decoder and produces significant improvements in the report generation task. Results: The proposed approach is evaluated on a publicly available Indiana University chest X-ray (IU-CXR) dataset. The CNN with RAM shows the significant improvement in recall (0.4423), precision (0.1803) and F1-score (0.2551) for prediction of multiple abnormalities in X-ray image. The results of language generation metrics for proposed variants were acquired using the COCO-caption evaluation Application Program Interface (API). The trained embeddings with AMLMA model generates the convincing radiology report and outperform state-of-the-art (SOTA) approaches with high evaluation metrics scores for Bleu-4 (0.172), Meteor (0.247), Rouge_L (0.376) and CIDEr (0.381). In addition, a new "Unique Index" (UI) statistic is introduced to highlight the model's ability for generating unique reports. Conclusion: The overall architecture aids to the understanding of various X-ray image views and generating the relevant normal and abnormal radiography statements. The proposed model is emphasized on multi-level visual-textual knowledge with adaptive attention mechanism to balance visual and linguistic information for the generation of admissible radiology report.

Deep neural models for automated multi-task diagnostic scan management—quality enhancement, view classification and report generation

MEDICAL IMAGING ENHANCEMENT WITH AI MODELS FOR AUTOMATIC DISEASE DETECTION AND CLASSIFICATION BASED ON MEDICAL IMAGES

Automatic Medical Report Generation Based on Cross-View Attention and Visual-Semantic Long Short Term Memorys

Multi-Disease Diagnosis Using Medical Images

Deep learning models for triaging hospital head MRI examinations

Improving Clinical Diagnosis Performance with Automated X-ray Scan Quality Enhancement Algorithms

Enhancing Radiology Diagnosis through Convolutional Neural Networks for Computer Vision in Healthcare

A deep neural network model for content-based medical image retrieval with multi-view classification

Computer-Aided Diagnosis of Thoracic Diseases in Chest X-rays using hybrid CNN-Transformer Architecture

Annotation of enhanced radiographs for medical image retrieval with deep convolutional neural networks

Automated Radiological Report Generation For Chest X-Rays With Weakly-Supervised End-to-End Deep Learning

Translating medical image to radiological report: Adaptive multilevel multi-attention approach

Automatically Generating Narrative-Style Radiology Reports from Volumetric CT Images; a Proof of Concept

Doctor's Dilemma: Evaluating an Explainable Subtractive Spatial Lightweight Convolutional Neural Network for Brain Tumor Diagnosis

Automated chest screening based on a hybrid model of transfer learning and convolutional sparse denoising autoencoder

Attention based automated radiology report generation using CNN and LSTM

AEDAMIDL: An Enhanced and Discriminant Analysis of Medical Images using Deep Learning

Automated Radiological Impression Generation For Plain Chest X-Rays With End To End Deep Learning

Deep Learning for Report Generation on Chest X-ray Images

Deep Learning Approaches for Medical Image Analysis and Diagnosis

Performance Investigation for Medical Image Evaluation and Diagnosis Using Machine-Learning and Deep-Learning Techniques