Abstract:The objective of detection in remote sensing images is to determine the location and category of all targets in these images. The anchor based methods are the most prevalent deep learning based methods, and still have some problems that need to be addressed. First, the existing metric (i.e., intersection over union (IoU)) could not measure the distance between two bounding boxes when they are nonoverlapping. Second, the exsiting bounding box regression loss could not directly optimize the metric in the training process. Third, the existing methods which adopt a hierarchical deep network only choose a single level feature layer for the feature extraction of region proposals, meaning they do not take full use of the advantage of multi-level features. To resolve the above problems, a novel object detection method for remote sensing images based on improved bounding box regression and multi-level features fusion is proposed in this paper. First, a new metric named generalized IoU is applied, which can quantify the distance between two bounding boxes, regardless of whether they are overlapping or not. Second, a novel bounding box regression loss is proposed, which can not only optimize the new metric (i.e., generalized IoU) directly but also overcome the problem that existing bounding box regression loss based on the new metric cannot adaptively change the gradient based on the metric value. Finally, a multi-level features fusion module is proposed and incorporated into the existing hierarchical deep network, which can make full use of the multi-level features for each region proposal. The quantitative comparisons between the proposed method and baseline method on the large scale dataset DIOR demonstrate that incorporating the proposed bounding box regression loss, multi-level features fusion module, and a combination of both into the baseline method can obtain an absolute gain of 0.7%, 1.4%, and 2.2% or so in terms of mAP, respectively. Comparing this with the state-of-the-art methods demonstrates that the proposed method has achieved a state-of-the-art performance. The curves of average precision with different thresholds show that the advantage of the proposed method is more evident when the threshold of generalized IoU (or IoU) is relatively high, which means that the proposed method can improve the precision of object localization. Similar conclusions can be obtained on a NWPU VHR-10 dataset.

An Improved Bounding Box Post-processing Algorithm with Faster R-CNN for High Spatial Resolution Remote Sensing Imagery Object Detection

A Simultaneous Object Detection and Component Segmentation Approach Based on Mask R-CNN

An Improved Faster R-CNN for Small Object Detection

Object Detection in Remote Sensing Images Based on Improved Bounding Box Regression and Multi-Level Features Fusion

Improved Oriented Object Detection in Remote Sensing Images Based on a Three-Point Regression Method

Deconv R-Cnn For Small Object Detection On Remote Sensing Images

Detection Selection Algorithm: A Likelihood based Optimization Method to Perform Post Processing for Object Detection

Improved Region Proposal Network for Enhanced Few-Shot Object Detection

Not All Boxes Are Equal: Learning to Optimize Bounding Boxes With Discriminative Distributions in Optical Remote Sensing Images

Arbitrary-angle bounding box based location for object detection in remote sensing image

Improved Object Detection Algorithm Based on Faster RCNN

Object Detection and Instance Segmentation in Remote Sensing Imagery Based on Precise Mask R-CNN

RecursiveDet: End-to-End Region-based Recursive Object Detection

Point-Based Weakly Supervised Learning for Object Detection in High Spatial Resolution Remote Sensing Images

A Multi-Scale Target Detection Method Using an Improved Faster Region Convolutional Neural Network Based on Enhanced Backbone and Optimized Mechanisms

On Improving Bounding Box Representations for Oriented Object Detection

Semi-Supervised SAR Target Detection Based on an Improved Faster R-CNN.

High Quality Object Detection for Multiresolution Remote Sensing Imagery Using Cascaded Multi-Stage Detectors.

An object detection method for catenary component images based on improved Faster R-CNN