Abstract:Image registration is important for medical imaging, the estimation of the spatial transformation between different images. Many previous studies have used learning-based methods for coarse-to-fine registration to efficiently perform 3D image registration. The coarse-to-fine approach, however, is limited when dealing with the different motions of nearby objects. Here we propose a novel Motion-Aware (MA) structure that captures the different motions in a region. The MA structure incorporates a novel Residual Aligner (RA) module which predicts the multi-head displacement field used to disentangle the different motions of multiple neighbouring objects. Compared with other deep learning methods, the network based on the MA structure and RA module achieve one of the most accurate unsupervised inter-subject registration on the 9 organs of assorted sizes in abdominal CT scans, with the highest-ranked registration of the veins (Dice Similarity Coefficient / Average surface distance: 62\%/4.9mm for the vena cava and 34\%/7.9mm for the portal and splenic vein), with a half-sized structure and more efficient computation. Applied to the segmentation of lungs in chest CT scans, the new network achieves results which were indistinguishable from the best-ranked networks (94\%/3.0mm). Additionally, the theorem on predicted motion pattern and the design of MA structure are validated by further analysis.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to more accurately capture and process different motion patterns of different adjacent objects or organs in medical image registration. Specifically, the authors propose a novel Motion - Aware (MA) structure and a Residual Aligner (RA) module to address the limitations of existing methods in handling complex motions (such as sliding motions), especially when different motions between different adjacent objects are involved. These problems are particularly prominent in traditional coarse - to - fine registration methods based on feature pyramids, which have bottlenecks in estimating different motion patterns and it is difficult to balance the relationship between similarity measurement and deformation rationality. ### Main contributions: 1. **New Motion - Aware (MA) structure**: Utilize high - resolution feature maps and dilated convolution to enhance the network's ability to predict different motion patterns. 2. **Residual Aligner (RA) module**: Use confidence and multi - head mechanism to align based on the semantic information of the image. 3. **Efficient coarse - to - fine, motion - aware unsupervised registration**: Achieve the state - of - the - art registration accuracy on publicly available lung and abdominal CT datasets, especially performing extremely well when dealing with small organs (such as veins). 4. **Theoretical analysis**: Verify the predicted motion patterns and the design of the MA structure through further analysis. ### Specific problems solved: - **Separation of motions of different adjacent objects**: Traditional methods are prone to confusing the motions of different adjacent objects when dealing with low - resolution feature maps, especially in edge regions. The MA structure proposed in the paper solves this problem by increasing the resolution of the feature maps. - **Bottlenecks in coarse - to - fine registration**: In the process of layer - by - layer refinement, traditional methods are difficult to accurately capture large - range motions due to the limitation of the resolution of feature maps. By introducing dilated convolution and high - resolution feature maps, the paper expands the motion capture range and improves the registration accuracy. - **Efficient computation**: Compared with other methods, the method proposed in the paper reduces the number of parameters and computational cost while maintaining high accuracy, achieving more efficient registration. ### Experimental results: - On the abdominal CT dataset, RAN performs excellently in the registration of multiple organs (such as hepatic veins, portal veins, and splenic veins), achieving the highest Dice Similarity Coefficient (DSC) and the lowest Average Surface Distance (ASD). - On the chest CT dataset, RAN also achieves results comparable to the best methods in lung registration. In conclusion, through proposing a new motion - aware structure and a residual aligner module, this paper effectively solves the problem of motion separation of different adjacent objects in medical image registration and achieves efficient and accurate unsupervised registration.

Residual Aligner Network

Residual Aligner-based Network (RAN): Motion-separable structure for coarse-to-fine discontinuous deformable registration

Real-Time 2D/3D Registration via CNN Regression and Centroid Alignment

Salient deformable network for abdominal multiorgan registration

Anatomically Constrained and Attention-Guided Deep Feature Fusion for Joint Segmentation and Deformable Medical Image Registration.

Structure-aware Registration Network for Liver DCE-CT Images

An Unsupervised Learning-Based Multi-Organ Registration Method for 3D Abdominal CT Images

Robust Fast Inter-Bin Image Registration for Undersampled Coronary MRI Based on a Learned Motion Prior

MA-VoxelMorph: Multi-scale Attention-based VoxelMorph for Non-Rigid Registration of Thoracoabdominal CT Images

Deformable Medical Image Registration with Global-Local Transformation Network and Region Similarity Constraint.

Joint segmentation and discontinuity-preserving deformable registration: Application to cardiac cine-MR images

Networks for Joint Affine and Non-parametric Image Registration

Hierarchical Cumulative Network for Unsupervised Medical Image Registration.

MvMM-RegNet: A new image registration framework based on multivariate mixture model and neural network estimation

NCNet: deformable medical image registration network based on neighborhood cross-attention combined with multi-resolution constraints.

MA-VoxelMorph: Multi-scale attention-based VoxelMorph for nonrigid registration of thoracoabdominal CT images

F3RNet: Full-Resolution Residual Registration Network for Deformable Image Registration

An efficient two-step multi-organ registration on abdominal CT via deep-learning based segmentation

Abdominal CT-CBCT Deformable Image Registration Using Deep Neural Network with Directional Local Structural Similarity

3D Biological/Biomedical Image Registration with enhanced Feature Extraction and Outlier Detection

CCMNet: Cross-scale correlation-aware mapping network for 3D lung CT image registration