Residual Aligner Network

Jian-Qing Zheng,Ziyang Wang,Baoru Huang,Ngee Han Lim,Bartlomiej W. Papiez
DOI: https://doi.org/10.1016/j.media.2023.103038
2022-03-08
Abstract:Image registration is important for medical imaging, the estimation of the spatial transformation between different images. Many previous studies have used learning-based methods for coarse-to-fine registration to efficiently perform 3D image registration. The coarse-to-fine approach, however, is limited when dealing with the different motions of nearby objects. Here we propose a novel Motion-Aware (MA) structure that captures the different motions in a region. The MA structure incorporates a novel Residual Aligner (RA) module which predicts the multi-head displacement field used to disentangle the different motions of multiple neighbouring objects. Compared with other deep learning methods, the network based on the MA structure and RA module achieve one of the most accurate unsupervised inter-subject registration on the 9 organs of assorted sizes in abdominal CT scans, with the highest-ranked registration of the veins (Dice Similarity Coefficient / Average surface distance: 62\%/4.9mm for the vena cava and 34\%/7.9mm for the portal and splenic vein), with a half-sized structure and more efficient computation. Applied to the segmentation of lungs in chest CT scans, the new network achieves results which were indistinguishable from the best-ranked networks (94\%/3.0mm). Additionally, the theorem on predicted motion pattern and the design of MA structure are validated by further analysis.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to more accurately capture and process different motion patterns of different adjacent objects or organs in medical image registration. Specifically, the authors propose a novel Motion - Aware (MA) structure and a Residual Aligner (RA) module to address the limitations of existing methods in handling complex motions (such as sliding motions), especially when different motions between different adjacent objects are involved. These problems are particularly prominent in traditional coarse - to - fine registration methods based on feature pyramids, which have bottlenecks in estimating different motion patterns and it is difficult to balance the relationship between similarity measurement and deformation rationality. ### Main contributions: 1. **New Motion - Aware (MA) structure**: Utilize high - resolution feature maps and dilated convolution to enhance the network's ability to predict different motion patterns. 2. **Residual Aligner (RA) module**: Use confidence and multi - head mechanism to align based on the semantic information of the image. 3. **Efficient coarse - to - fine, motion - aware unsupervised registration**: Achieve the state - of - the - art registration accuracy on publicly available lung and abdominal CT datasets, especially performing extremely well when dealing with small organs (such as veins). 4. **Theoretical analysis**: Verify the predicted motion patterns and the design of the MA structure through further analysis. ### Specific problems solved: - **Separation of motions of different adjacent objects**: Traditional methods are prone to confusing the motions of different adjacent objects when dealing with low - resolution feature maps, especially in edge regions. The MA structure proposed in the paper solves this problem by increasing the resolution of the feature maps. - **Bottlenecks in coarse - to - fine registration**: In the process of layer - by - layer refinement, traditional methods are difficult to accurately capture large - range motions due to the limitation of the resolution of feature maps. By introducing dilated convolution and high - resolution feature maps, the paper expands the motion capture range and improves the registration accuracy. - **Efficient computation**: Compared with other methods, the method proposed in the paper reduces the number of parameters and computational cost while maintaining high accuracy, achieving more efficient registration. ### Experimental results: - On the abdominal CT dataset, RAN performs excellently in the registration of multiple organs (such as hepatic veins, portal veins, and splenic veins), achieving the highest Dice Similarity Coefficient (DSC) and the lowest Average Surface Distance (ASD). - On the chest CT dataset, RAN also achieves results comparable to the best methods in lung registration. In conclusion, through proposing a new motion - aware structure and a residual aligner module, this paper effectively solves the problem of motion separation of different adjacent objects in medical image registration and achieves efficient and accurate unsupervised registration.