AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection

Majedaldein Almahasneh,Xianghua Xie,Adeline Paiement
2024-07-20
Abstract:Motivated by the increasing popularity of attention mechanisms, we observe that popular convolutional (conv.) attention models like Squeeze-and-Excite (SE) and Convolutional Block Attention Module (CBAM) rely on expensive multi-layer perception (MLP) layers. These MLP layers significantly increase computational complexity, making such models less applicable to 3D image contexts, where data dimensionality and computational costs are higher. In 3D medical imaging, such as 3D pulmonary CT scans, efficient processing is crucial due to the large data volume. Traditional 2D attention generalized to 3D increases the computational load, creating demand for more efficient attention mechanisms for 3D tasks. We investigate the possibility of incorporating fully convolutional (conv.) attention in 3D context. We present two 3D fully conv. attention blocks, demonstrating their effectiveness in 3D context. Using pulmonary CT scans for 3D lung nodule detection, we present AttentNet, an automated lung nodule detection framework from CT images, performing detection as an ensemble of two stages, candidate proposal and false positive (FP) reduction. We compare the proposed 3D attention blocks to popular 2D conv. attention methods generalized to 3D modules and to self-attention units. For the FP reduction stage, we also use a joint analysis approach to aggregate spatial information from different contextual levels. We use LUNA-16 lung nodule detection dataset to demonstrate the benefits of the proposed fully conv. attention blocks compared to baseline popular lung nodule detection methods when no attention is used. Our work does not aim at achieving state-of-the-art results in the lung nodule detection task, rather to demonstrate the benefits of incorporating fully conv. attention within a 3D context.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the efficiency and accuracy of 3D lung nodule detection, especially to reduce the computational complexity and the false - positive rate. Specifically: 1. **Computational complexity problem**: Existing convolutional attention models (such as Squeeze - and - Excite (SE) and Convolutional Block Attention Module (CBAM)) rely on multi - layer perceptron (MLP) layers, which significantly increase the computational complexity and make these models inapplicable when dealing with 3D images. The data dimension and computational cost of 3D medical images (such as 3D lung CT scans) are high, so a more efficient attention mechanism is required. 2. **False - positive reduction problem**: In lung nodule detection, the traditional two - stage methods (candidate proposal and false - positive reduction) usually produce a high false - positive rate. The paper proposes a joint analysis method to aggregate spatial information at different context levels in order to reduce the false - positive rate. ### Specific objectives of the paper 1. **Introduce a fully convolutional attention mechanism**: The paper proposes two 3D fully convolutional attention blocks for efficiently inferring spatial correlations across channels and across slices. These attention blocks aim to improve the performance of the 3D lung nodule detection task and their effectiveness is verified through experiments. 2. **Improve the false - positive reduction stage**: The paper adopts a joint analysis method to simultaneously aggregate spatial information from different context levels in order to improve the performance of the false - positive reduction stage. In addition, a zoom - in convolutional path is proposed to help the network capture multi - scale spatial embeddings at different scales. 3. **Evaluate different attention mechanisms**: The paper conducts extensive experiments on different existing attention mechanisms and compares them with the proposed fully convolutional attention blocks to evaluate their performance in the 3D lung nodule detection framework. ### Main contributions 1. **Propose two 3D fully convolutional attention blocks**: These attention blocks can effectively infer spatial correlations across channels and across slices and are verified on the LUNA16 dataset, demonstrating their potential in the lung nodule detection task. 2. **Adopt a joint analysis method**: By aggregating spatial information at different context levels, the performance of the false - positive reduction stage is improved, and a zoom - in convolutional path is proposed to enhance the effect of the final prediction. ### Summary The main purpose of the paper is to solve the computational complexity and false - positive rate problems in 3D lung nodule detection by introducing an efficient fully convolutional attention mechanism, thereby improving the accuracy and efficiency of detection.