Abstract:Point cloud-based 3-D object detection is a significant and critical issue in numerous applications. While most existing methods attempt to capitalize on the geometric characteristics of point clouds, they neglect the internal semantic properties of point and the consistency between the semantic and geometric clues. We introduce a semantic consistency (SC) mechanism for 3-D object detection in this article, by reasoning about the semantic relations between 3-D object boxes and its internal points. This mechanism is based on a natural principle: the semantic category of a 3-D bounding box should be consistent with the categories of all points within the box. Driven by the SC mechanism, we propose a novel SC network (SCNet) to detect 3-D objects from point clouds. Specifically, the SCNet is composed of a feature extraction module, a detection decision module, and a semantic segmentation module. In inference, the feature extraction and the detection decision modules are used to detect 3-D objects. In training, the semantic segmentation module is jointly trained with the other two modules to produce more robust and applicable model parameters. The performance is greatly boosted through reasoning about the relations between the output 3-D object boxes and segmented points. The proposed SC mechanism is model-agnostic and can be integrated into other base 3-D object detection models. We test the proposed model on three challenging indoor and outdoor benchmark datasets: ScanNetV2, SUN RGB-D, and KITTI. Furthermore, to validate the universality of the SC mechanism, we implement it in three different 3-D object detectors. The experiments show that the performance is impressively improved and the extensive ablation studies also demonstrate the effectiveness of the proposed model.

Semantic R-CNN for Natural Language Object Detection.

A Simultaneous Object Detection and Component Segmentation Approach Based on Mask R-CNN

Natural Language Object Retrieval

Deconv R-Cnn For Small Object Detection On Remote Sensing Images

PG-RCNN: Semantic Surface Point Generation for 3D Object Detection

PV-RCNN++: Semantical Point-Voxel Feature Interaction for 3D Object Detection

R-CNN minus R

LiDAR-only 3D Object Detection Based on Spatial Context

Semantic Consistency Reasoning for 3-D Object Detection in Point Clouds

Specific category region proposal network for text detection in natural scene

Semantic-Context Graph Network for Point-based 3D Object Detection

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Deep feature based contextual model for object detection

R-SNN: Region-Based Spiking Neural Network for Object Detection

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Semantic Analysis System to Recognize Moving Objects by Using a Deep Learning Model

Image Captioning with Object Detection and Localization.

End-to-end Semantic Object Detection with Cross-Modal Alignment

SG-LPR: Semantic-Guided LiDAR-Based Place Recognition

A Refined and Efficient CNN Algorithm for Remote Sensing Object Detection

R-FCN plus plus : Towards Accurate Region-Based Fully Convolutional Networks for Object Detection