Self-improving generative foundation model for synthetic medical image generation and clinical applications
Jinzhuo Wang,Kai Wang,Yunfang Yu,Yuxing Lu,Wenchao Xiao,Zhuo Sun,Fei Liu,Zixing Zou,Yuanxu Gao,Lei Yang,Hong-Yu Zhou,Hanpei Miao,Wenting Zhao,Lisha Huang,Lingchao Zeng,Rui Guo,Ieng Chong,Boyu Deng,Linling Cheng,Xiaoniao Chen,Jing Luo,Meng-Hua Zhu,Daniel Baptista-Hon,Olivia Monteiro,Ming Li,Yu Ke,Jiahui Li,Simiao Zeng,Taihua Guan,Jin Zeng,Kanmin Xue,Eric Oermann,Huiyan Luo,Yun Yin,Kang Zhang,Jia Qu
DOI: https://doi.org/10.1038/s41591-024-03359-y
IF: 82.9
2024-12-13
Nature Medicine
Abstract:In many clinical and research settings, the scarcity of high-quality medical imaging datasets has hampered the potential of artificial intelligence (AI) clinical applications. This issue is particularly pronounced in less common conditions, underrepresented populations and emerging imaging modalities, where the availability of diverse and comprehensive datasets is often inadequate. To address this challenge, we introduce a unified medical image–text generative model called MINIM that is capable of synthesizing medical images of various organs across various imaging modalities based on textual instructions. Clinician evaluations and rigorous objective measurements validate the high quality of MINIM's synthetic images. MINIM exhibits an enhanced generative capability when presented with previously unseen data domains, demonstrating its potential as a generalist medical AI (GMAI). Our findings show that MINIM's synthetic images effectively augment existing datasets, boosting performance across multiple medical applications such as diagnostics, report generation and self-supervised learning. On average, MINIM enhances performance by 12% for ophthalmic, 15% for chest, 13% for brain and 17% for breast-related tasks. Furthermore, we demonstrate MINIM's potential clinical utility in the accurate prediction of HER2-positive breast cancer from MRI images. Using a large retrospective simulation analysis, we demonstrate MINIM's clinical potential by accurately identifying targeted therapy-sensitive EGFR mutations using lung cancer computed tomography images, which could potentially lead to improved 5-year survival rates. Although these results are promising, further validation and refinement in more diverse and prospective settings would greatly enhance the model's generalizability and robustness.
biochemistry & molecular biology,cell biology,medicine, research & experimental