A Novel Cross-Layer Dual Encoding-Shared Decoding Network Framework with Spatial Self-Attention Mechanism for Hippocampus Segmentation

Jia-Ni Li,Shao-Wu Zhang,Yan-Rui Qiang,Qin-Yi Zhou
DOI: https://doi.org/10.1016/j.compbiomed.2023.107584
IF: 7.7
2023-01-01
Computers in Biology and Medicine
Abstract:Accurate segmentation of the hippocampus from the brain magnetic resonance images (MRIs) is a crucial task in the neuroimaging research, since its structural integrity is strongly related to several neurodegenerative disorders, such as Alzheimer’s disease (AD). Automatic segmentation of the hippocampus structures is challenging due to the small volume, complex shape, low contrast and discontinuous boundaries of hippocampus. Although some methods have been developed for the hippocampus segmentation, most of them paid too much attention to the hippocampus shape and volume instead of considering the spatial information. Additionally, the extracted features are independent of each other, ignoring the correlation between the global and local information. In view of this, here we proposed a novel cross-layer dual Encoding-Shared Decoding network framework with Spatial self-Attention mechanism (called ESDSA) for hippocampus segmentation in human brains. Considering that the hippocampus is a relatively small part in MRI, we introduced the spatial self-attention mechanism in ESDSA to capture the spatial information of hippocampus for improving the segmentation accuracy. We also designed a cross-layer dual encoding-shared decoding network to effectively extract the global information of MRIs and the spatial information of hippocampus. The spatial features of hippocampus and the features extracted from the MRIs were combined to realize the hippocampus segmentation. Results on the baseline T1-weighted structural MRI data show that the performance of our ESDSA is superior to other state-of-the-art methods, and the dice similarity coefficient of ESDSA achieves 89.37%. In addition, the dice similarity coefficient of the Spatial Self-Attention mechanism (SSA) strategy and the dual Encoding-Shared Decoding (ESD) strategy is 9.47%, 5.35% higher than that of the baseline U-net, respectively, indicating that the strategies of SSA and ESD can effectively enhance the segmentation accuracy of human brain hippocampus.
What problem does this paper attempt to address?