Task-Specific Data Preparation for Deep Learning to Reconstruct Structures of Interest from Severely Truncated CBCT Data

Yixing Huang,Fuxin Fan,Ahmed Gomaa,Andreas Maier,Rainer Fietkau,Christoph Bert,Florian Putz
2024-09-13
Abstract:Cone-beam computed tomography (CBCT) is widely used in interventional surgeries and radiation oncology. Due to the limited size of flat-panel detectors, anatomical structures might be missing outside the limited field-of-view (FOV), which restricts the clinical applications of CBCT systems. Recently, deep learning methods have been proposed to extend the FOV for multi-slice CT systems. However, in mobile CBCT system with a smaller FOV size, projection data is severely truncated and it is challenging for a network to restore all missing structures outside the FOV. In some applications, only certain structures outside the FOV are of interest, e.g., ribs in needle path planning for liver/lung cancer diagnosis. Therefore, a task-specific data preparation method is proposed in this work, which automatically let the network focus on structures of interest instead of all the structures. Our preliminary experiment shows that Pix2pixGAN with a conventional training has the risk to reconstruct false positive and false negative rib structures from severely truncated CBCT data, whereas Pix2pixGAN with the proposed task-specific training can reconstruct all the ribs reliably. The proposed method is promising to empower CBCT with more clinical applications.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in the case of severe truncation of cone - beam computed tomography (CBCT) data, how to reconstruct the structures of interest (SOI) through deep - learning methods, especially those structures located outside the field - of - view (FOV). Due to the limited size of the flat - panel detector in the CBCT system, anatomical structures may be partially or completely outside the FOV, which limits the clinical application of the CBCT system. Specifically, the paper focuses on how to use the task - specific data preparation method to enable the neural network to focus on reconstructing specific structures of interest (such as ribs), rather than attempting to restore all missing structures. This is very important for some clinical application scenarios. For example, in image - guided needle biopsy, accurate reconstruction of ribs is required to plan a safe needle - insertion path. ### Main contributions of the paper 1. **Propose a task - specific data preparation method**: Through this method, the neural network can focus on reconstructing specific structures of interest (such as ribs) and ignore other unimportant structures. 2. **Improve reconstruction accuracy**: The experimental results show that the Pix2pixGAN model trained with task - specific data can more accurately reconstruct the number and position of ribs, while traditional methods may produce false - positive and false - negative results. 3. **Potential clinical applications**: This method is expected to expand the use of CBCT in more clinical applications, especially in tasks that require high - precision reconstruction of specific structures. ### Formula representation The data preparation method mentioned in the paper can be represented by the following formulas: - **Traditional data preparation method**: \[ f_{\text{input}} = R(A_{TP} \cdot f) \] \[ f_{\text{label, conventional}} = R(A_{UTP} \cdot f) \] - **Task - specific data preparation method**: \[ f_{\text{input}} = R(A_{TP} \cdot f) = R(A_{TP}(f_{\text{Others}})) + R(A_{TP}(f_{\text{SOI}})) \] \[ f_{\text{label, task - specific}} = R(A_{TP}(f_{\text{Others}})) + R(A_{UTP}(f_{\text{SOI}})) \] where: - \( f \) is the complete reference 3D image. - \( f_{\text{input}} \) is the network input. - \( f_{\text{label, conventional}} \) and \( f_{\text{label, task - specific}} \) are the traditional and task - specific network output labels respectively. - \( R \) represents the image reconstruction operation. - \( A_{TP} \) and \( A_{UTP} \) are the truncated and untruncated forward projection operators respectively. - \( f_{\text{SOI}} \) and \( f_{\text{Others}} \) are the segmented structures of interest and other structures respectively. Through this task - specific data preparation method, the neural network can focus more on reconstructing specific structures of interest, thereby improving the accuracy and robustness of reconstruction.