A foundation model for generalizable disease diagnosis in chest X-ray images

Lijian Xu,Ziyu Ni,Hao Sun,Hongsheng Li,Shaoting Zhang
2024-10-11
Abstract:Medical artificial intelligence (AI) is revolutionizing the interpretation of chest X-ray (CXR) images by providing robust tools for disease diagnosis. However, the effectiveness of these AI models is often limited by their reliance on large amounts of task-specific labeled data and their inability to generalize across diverse clinical settings. To address these challenges, we introduce CXRBase, a foundational model designed to learn versatile representations from unlabelled CXR images, facilitating efficient adaptation to various clinical tasks. CXRBase is initially trained on a substantial dataset of 1.04 million unlabelled CXR images using self-supervised learning methods. This approach allows the model to discern meaningful patterns without the need for explicit labels. After this initial phase, CXRBase is fine-tuned with labeled data to enhance its performance in disease detection, enabling accurate classification of chest diseases. CXRBase provides a generalizable solution to improve model performance and alleviate the annotation workload of experts to enable broad clinical AI applications from chest imaging.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address two major issues in the diagnosis of diseases using chest X-ray images: 1. **Dependency on Data Annotation**: Existing AI models typically require a large amount of task-specific annotated data for effective training, which limits the practicality and scalability of the models. 2. **Lack of Generalization Ability**: Existing models have poor adaptability and generalization ability in different clinical environments. To address these issues, the research team introduced a foundational model named CXRBase. This model employs a self-supervised learning approach to pre-train on a large number of unannotated chest X-ray images, thereby learning general and robust feature representations. Specifically, CXRBase was first pre-trained on over 1 million unannotated chest X-ray images and then fine-tuned on a small amount of annotated data to enhance its performance in disease detection. This approach not only reduces the reliance on expert annotation but also significantly improves the model's performance across various clinical tasks, particularly demonstrating excellent performance in disease classification and localization tasks.