Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion Weighted Imaging Data via Anatomic-Conditional Controlled Latent Diffusion

Aditya Sridhar,Chi-en Amy Tai,Hayden Gunraj,Yuhao Chen,Alexander Wong
2023-11-30
Abstract:In Canada, prostate cancer is the most common form of cancer in men and accounted for 20% of new cancer cases for this demographic in 2022. Due to recent successes in leveraging machine learning for clinical decision support, there has been significant interest in the development of deep neural networks for prostate cancer diagnosis, prognosis, and treatment planning using diffusion weighted imaging (DWI) data. A major challenge hindering widespread adoption in clinical use is poor generalization of such networks due to scarcity of large-scale, diverse, balanced prostate imaging datasets for training such networks. In this study, we explore the efficacy of latent diffusion for generating realistic prostate DWI data through the introduction of an anatomic-conditional controlled latent diffusion strategy. To the best of the authors' knowledge, this is the first study to leverage conditioning for synthesis of prostate cancer imaging. Experimental results show that the proposed strategy, which we call Cancer-Net PCa-Gen, enhances synthesis of diverse prostate images through controllable tumour locations and better anatomical and textural fidelity. These crucial features make it well-suited for augmenting real patient data, enabling neural networks to be trained on a more diverse and comprehensive data distribution. The Cancer-Net PCa-Gen framework and sample images have been made publicly available at <a class="link-external link-https" href="https://www.kaggle.com/datasets/deetsadi/cancer-net-pca-gen-dataset" rel="external noopener nofollow">this https URL</a> as a part of a global open-source initiative dedicated to accelerating advancement in machine learning to aid clinicians in the fight against cancer.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the main problems encountered when applying prostate cancer diffusion - weighted imaging (DWI) data in deep - learning models, namely **the lack of training data and insufficient diversity**. Specifically: 1. **Challenges in Prostate Cancer Diagnosis**: - Prostate cancer is the most common type of cancer among men in Canada, accounting for 20% of newly diagnosed cancer cases in 2022. - Although significant progress has been made in using machine learning for clinical decision support, the wide adoption of these models in actual clinical applications has been limited, mainly due to the lack of large - scale, diverse, and balanced prostate imaging datasets. 2. **Limitations of Existing Methods**: - Current generative models for generating medical images (such as DDPMs) often have inaccurate anatomical structures in the generated images due to their randomness, resulting in unstable performance of the trained neural networks and being unable to meet the needs of medical decision - making. 3. **The Proposed New Method**: - To solve the above problems, the authors proposed **Cancer - Net PCa - Gen**, which is a **anatomic - conditional controlled latent diffusion** - based model, aiming to generate realistic prostate DWI data. - By introducing anatomic - conditional control, this model can better control the tumor location when generating images and improve the authenticity of anatomy and texture, thereby generating more diverse and high - quality synthetic images. 4. **Objectives and Significance**: - By generating diverse synthetic data, Cancer - Net PCa - Gen can enhance the diversity of real - patient data, enabling neural networks to be trained on a broader and more comprehensive data distribution. - This will help improve the generalization ability and accuracy of deep - learning models in prostate cancer diagnosis, prognosis, and treatment planning, and ultimately assist clinicians in better dealing with prostate cancer. In conclusion, the core issue of this paper is to overcome the problems of insufficient training data and lack of diversity faced by existing deep - learning models in prostate cancer research by generating high - quality synthetic prostate DWI data.