INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Mapping

Chulong Zhang,Xiaokun Liang
2023-11-27
Abstract:We propose a fluid-based registration framework of medical images based on implicit neural representation. By integrating implicit neural representation and Large Deformable Diffeomorphic Metric Mapping (LDDMM), we employ a Multilayer Perceptron (MLP) as a velocity generator while optimizing velocity and image similarity. Moreover, we adopt a coarse-to-fine approach to address the challenge of deformable-based registration methods dropping into local optimal solutions, thus aiding the management of significant deformations in medical image registration. Our algorithm has been validated on a paired CT-CBCT dataset of 50 patients,taking the Dice coefficient of transferred annotations as an evaluation metric. Compared to existing methods, our approach achieves the state-of-the-art performance.
Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to more effectively combine Implicit Neural Representation (INR) with Large Deformation Diffeomorphic Metric Mapping (LDDMM) in medical image registration to improve registration accuracy and efficiency. Specifically, the paper proposes a fluid - based medical image registration framework - INR - LDDMM, aiming to optimize the velocity field and image similarity by integrating INR and LDDMM, and adopts a coarse - to - fine approach to solve the problem that deformation - based registration methods are prone to fall into local optimal solutions, so as to better handle significant deformations in medical images. ### Main contributions of the paper: 1. **Implicit neural representation for time - space - related velocity field estimation is proposed for the first time**: By combining implicit neural representation with LDDMM, efficient estimation of the velocity field is achieved. 2. **The speed of INR - LDDMM is optimized**: A coarse - to - fine framework is adopted to gradually optimize the velocity field and avoid falling into local optimal solutions prematurely. 3. **Verification is carried out on a CT - CBCT paired data set of 50 patients**: The results show that this method has reached the current best level in registration accuracy. ### Technical details: - **Problem formalization**: In the LDDMM framework, the spatial transformation \(\phi\) of the image is achieved by calculating the space - time velocity field \(v(t, x)\), and the objective function is minimized by optimizing the velocity field and image similarity. - The objective function \(L\) is defined as: \[ L=\arg\min_v\left(\text{Sim}(I_1(1), I_2)+\frac{1}{2}\int_0^1\|v(t)\|^2_Ldt\right) \] where \(\text{Sim}(I_1(1), I_2)\) represents the similarity measurement between the moving image \(I_1\) and the fixed image \(I_2\), and \(v(t)\) is the velocity field. - **INR - LDDMM framework**: A three - layer multi - layer perceptron (MLP) network is used to predict the local velocity at any position in a 3D image. The input is the position and time \(X=(x, y, z, t)\), and the output is the velocity \(V=(v_x, v_y, v_z)\). Through iterative updating of the position and time, the displacement field \(S\) is finally obtained. - The calculation formula of the displacement field \(S\) is: \[ S = \sum\Delta P_t=\sum(\partial_x\Delta t,\partial_y\Delta t,\partial_z\Delta t) \] - **Coarse - to - fine optimization framework**: First, a rough displacement field \(S_1\) is generated at a low resolution, then it is converted into a high - resolution displacement field \(S'_1\) through bilinear interpolation, and then fine - tuned on this basis, and finally the high - resolution displacement field \(S_f\) is obtained. ### Experimental results: - **Data set**: A CT - CBCT paired data set of 50 patients who received radiotherapy after breast - conserving surgery for breast cancer was used. The size of each image is 256×256×96. - **Experimental comparison**: Compared with traditional iterative methods and deep - learning - based methods, the results show that INR - LDDMM is superior to other methods in the Dice coefficient (DSC) of multiple structures. ### Conclusion: The INR - LDDMM algorithm proposed in this study performs well in both registration accuracy and efficiency. Especially when dealing with medical images with significant deformations, it can effectively avoid the problem of local optimal solutions. Through verification on actual data sets, the effectiveness and superiority of this method are proved.