DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Xuhui Liu,Zhi Qiao,Runkun Liu,Hong Li,Juan Zhang,Xiantong Zhen,Zhen Qian,Baochang Zhang
2024-07-18
Abstract:Computed tomography (CT) is widely utilized in clinical settings because it delivers detailed 3D images of the human body. However, performing CT scans is not always feasible due to radiation exposure and limitations in certain surgical environments. As an alternative, reconstructing CT images from ultra-sparse X-rays offers a valuable solution and has gained significant interest in scientific research and medical applications. However, it presents great challenges as it is inherently an ill-posed problem, often compromised by artifacts resulting from overlapping structures in X-ray images. In this paper, we propose DiffuX2CT, which models CT reconstruction from orthogonal biplanar X-rays as a conditional diffusion process. DiffuX2CT is established with a 3D global coherence denoising model with a new, implicit conditioning mechanism. We realize the conditioning mechanism by a newly designed tri-plane decoupling generator and an implicit neural decoder. By doing so, DiffuX2CT achieves structure-controllable reconstruction, which enables 3D structural information to be recovered from 2D X-rays, therefore producing faithful textures in CT images. As an extra contribution, we collect a real-world lumbar CT dataset, called LumbarV, as a new benchmark to verify the clinical significance and performance of CT reconstruction from X-rays. Extensive experiments on this dataset and three more publicly available datasets demonstrate the effectiveness of our proposal.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily aims to address the following issues: 1. **Reconstructing high-quality CT images from biplanar X-rays**: - Computed Tomography (CT) is widely used in clinical settings to obtain detailed 3D images of the human body. However, performing CT scans in certain surgical scenarios is not always feasible due to the risk of radiation exposure and other limitations. - As an alternative, reconstructing CT images from ultra-sparse X-ray images offers a valuable solution, but this is inherently an ill-posed problem, often leading to artifacts due to structural overlap in X-ray images. 2. **Proposing a new conditional diffusion model DiffuX2CT**: - The paper introduces a new 3D conditional diffusion model called DiffuX2CT, which models the CT reconstruction task as a conditional denoising diffusion process, with biplanar orthogonal X-rays as the condition. - DiffuX2CT, by introducing a new implicit conditional mechanism (ICM), is capable of recovering 3D structural information from 2D X-rays, thereby generating high-fidelity CT images. 3. **Validating clinical significance**: - To validate the clinical significance of CT reconstruction using biplanar X-rays, the authors collected a new real-world lumbar CT dataset (LumbarV), containing 268 3D CT images from different patients. Each patient has implants. Experimental results show that DiffuX2CT can generate high-quality CT images, aiding surgeons in locating implants and measuring vertebral shapes. Through these efforts, the paper aims to provide an efficient and accurate method to reconstruct high-quality CT images from biplanar X-rays, thereby reducing reliance on CT scans and minimizing radiation exposure risks.