Investigating Attention Mechanism in 3D Point Cloud Object Detection

Shi Qiu,Yunfan Wu,Saeed Anwar,Chongyi Li
DOI: https://doi.org/10.48550/arXiv.2108.00620
2021-10-14
Abstract:Object detection in three-dimensional (3D) space attracts much interest from academia and industry since it is an essential task in AI-driven applications such as robotics, autonomous driving, and augmented reality. As the basic format of 3D data, the point cloud can provide detailed geometric information about the objects in the original 3D space. However, due to 3D data's sparsity and unorderedness, specially designed networks and modules are needed to process this type of data. Attention mechanism has achieved impressive performance in diverse computer vision tasks; however, it is unclear how attention modules would affect the performance of 3D point cloud object detection and what sort of attention modules could fit with the inherent properties of 3D data. This work investigates the role of the attention mechanism in 3D point cloud object detection and provides insights into the potential of different attention modules. To achieve that, we comprehensively investigate classical 2D attentions, novel 3D attentions, including the latest point cloud transformers on SUN RGB-D and ScanNetV2 datasets. Based on the detailed experiments and analysis, we conclude the effects of different attention modules. This paper is expected to serve as a reference source for benefiting attention-embedded 3D point cloud object detection. The code and trained models are available at: <a class="link-external link-https" href="https://github.com/ShiQiu0419/attentions_in_3D_detection" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in 3D point - cloud object detection, the role of the attention mechanism and its impact on performance are still unclear. Specifically, the author aims to explore how different types of attention modules (including the classic 2D attention modules and the new 3D attention modules) affect the effectiveness of 3D point - cloud object detection, and analyze which attention module is most suitable for handling the inherent characteristics of 3D data. ### Research Background and Problem Description 3D point - cloud object detection is of great significance in AI - driven applications such as autonomous driving, robotics, and augmented reality. However, due to the sparsity and disorder of 3D point - cloud data, traditional 2D object detection methods cannot be directly applied to 3D point - cloud. Therefore, special networks and modules need to be designed to process this type of data. Although the attention mechanism has performed well in various computer vision tasks, its application in 3D point - cloud object detection has not been fully studied. ### Main Research Contents To fill this gap, the author has carried out the following work: 1. **Comprehensive Evaluation**: The author selected two commonly used 3D point - cloud object detection datasets, SUN RGB - D and ScanNetV2, and conducted a detailed evaluation of five classic 2D attention modules and five new 3D attention modules. 2. **Experimental Verification**: Through a large number of experiments on these datasets, the author analyzed the impact of different attention modules on the performance of 3D point - cloud object detection, and summarized the characteristics and advantages of various attention modules. 3. **Propose Insights**: Based on the experimental results, the author provided valuable references for future research, pointing out the potential and limitations of different types of attention modules in 3D point - cloud object detection. ### Main Contributions - **Improve VoteNet**: By integrating the attention mechanism into VoteNet, the author improved its performance in 3D point - cloud object detection. - **Comprehensive Evaluation**: For the first time, a comprehensive evaluation of the performance of ten recent attention modules in 3D point - cloud object detection was carried out. - **Provide Insights**: Specifically summarized the effects and characteristics of different types of attention modules, providing new insights and inspiration for understanding the application of the attention mechanism in 3D point - cloud object detection. ### Conclusion Through the above research, the author not only verified the effectiveness of the attention mechanism in 3D point - cloud object detection, but also provided important references and directions for subsequent research.