Abstract:Chest X-Ray (CXR) is a widely used clinical imaging modality and has a pivotal role in the diagnosis and prognosis of various lung and heart related conditions. Conventional automated clinical diagnostic tool design strategies relying on radiology reads and supervised learning, entail the cumbersome requirement of high quality annotated training data. To address this challenge, self-supervised pre-training has proven to outperform supervised pre-training in numerous downstream vision tasks, representing a significant breakthrough in the field. However, medical imaging pre-training significantly differs from pre-training with natural images (e.g., ImageNet) due to unique attributes of clinical images. In this context, we introduce Diverse Concept Modeling (DiCoM), a novel self-supervised training paradigm that leverages a student teacher framework for learning diverse concepts and hence effective representation of the CXR data. Hence, expanding beyond merely modeling a single primary label within an image, instead, effectively harnessing the information from all the concepts inherent in the CXR. The pre-trained model is subsequently fine-tuned to address diverse domain-specific tasks. Our proposed paradigm consistently demonstrates robust performance across multiple downstream tasks on multiple datasets, highlighting the success and generalizability of the pre-training strategy. To establish the efficacy of our methods we analyze both the power of learned representations and the speed of convergence (SoC) of our models. For diverse data and tasks, DiCoM is able to achieve in most cases better results compared to other state-of-the-art pre-training strategies. This when combined with the higher SoC and generalization capabilities positions DiCoM to be established as a foundation model for CXRs, a widely used imaging modality.

MoCo-CXR: MoCo Pretraining Improves Representation and Transferability of Chest X-ray Models

Enhancing representation in radiography-reports foundation model: a granular alignment algorithm using masked contrastive learning

DiCoM -- Diverse Concept Modeling towards Enhancing Generalizability in Chest X-Ray Studies

MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report

Contrastive Learning with Temporal Correlated Medical Images: A Case Study using Lung Segmentation in Chest X-Rays

Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging

ReCo-CXR: A Self-Supervised Pre-Training Framework for Pulmonary Nodule Detection in X-Ray Images

Robust image representations with counterfactual contrastive learning

Momentum Contrast for Unsupervised Visual Representation Learning

Multimodal masked siamese network improves chest X-ray representation learning

Contrastive learning with token projection for Omicron pneumonia identification from few-shot chest CT images

Cxrmim: masked image modeling pre-training paradigm for chest x-ray images analysis

Improving CXR Self-Supervised Representation by Pretext Task and Cross-Domain Synthetic Data

MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging

Radiology Reports Improve Visual Representations Learned from Radiographs

Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning

Molecule-Morphology Contrastive Pretraining for Transferable Molecular Representation

Learning Generalized Medical Image Representations through Image-Graph Contrastive Pretraining

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning.