An Improved Bounding Box Post-processing Algorithm with Faster R-CNN for High Spatial Resolution Remote Sensing Imagery Object Detection

Yanfei Zhong,Xiaobing Han,Liangpei Zhang
2017-01-01
Abstract:Multi-class geospatial object detection from high spatial resolution (HSR) remote sensing imagery is a profound but challenging task. Due to the powerful feature representation ability of deep learning, object detection from HSR remote sensing imagery is more and more efficient. Among the deep learning based object detection methods, region proposal based object detection method (e.g., faster region based convolutional neural network (Faster R-CNN)) is effective, which requires to generate the region proposals from the input imagery at first and then regress the locations of the bounding boxes and classify the categories. As an important and requisite procedure of Faster R-CNN, properly selecting and accurately suppressing the redundant bounding boxes is significant and critical. Traditional bounding boxes suppression operation is conducted mainly by the non-maximum suppression (NMS) operation, which only conserve the maximum score of the predicted bounding boxes and deletes the non-maximum scores. However, the NMS operation may be of little effect especially for the situations where the objects are densely distributed, the bounding box sizes are not proper, and the overlaps exist between the bounding boxes. To relieve this phenomenon in object detection fields and further improve the detection performance of the region proposal based Faster R-CNN frameworks, this paper proposed to utilize a sigmoid derivative decay function to replace the traditional NMS for Faster R-CNN. The proposed algorithm has been validated and experimented on a publicly available ten-class object detection dataset.
What problem does this paper attempt to address?