Abstract:Deep learning is currently the mainstream method of object detection. Faster region-based convolutional neural network (Faster R-CNN) has a pivotal position in deep learning. It has impressive detection effects in ordinary scenes. However, under special conditions, there can still be unsatisfactory detection performance, such as the object having problems like occlusion, deformation, or small size. This paper proposes a novel and improved algorithm based on the Faster R-CNN framework combined with the Faster R-CNN algorithm with skip pooling and fusion of contextual information. This algorithm can improve the detection performance under special conditions on the basis of Faster R-CNN. The improvement mainly has three parts: The first part adds a context information feature extraction model after the conv5_3 of the convolutional layer; the second part adds skip pooling so that the former can fully obtain the contextual information of the object, especially for situations where the object is occluded and deformed; and the third part replaces the region proposal network (RPN) with a more efficient guided anchor RPN (GA-RPN), which can maintain the recall rate while improving the detection performance. The latter can obtain more detailed information from different feature layers of the deep neural network algorithm, and is especially aimed at scenes with small objects. Compared with Faster R-CNN, you only look once series (such as: YOLOv3), single shot detector (such as: SSD512), and other object detection algorithms, the algorithm proposed in this paper has an average improvement of 6.857% on the mean average precision (mAP) evaluation index while maintaining a certain recall rate. This strongly proves that the proposed method has higher detection rate and detection efficiency in this case.

Pay Attention to Them: Deep Reinforcement Learning-Based Cascade Object Detection.

Improving object detection with deep convolutional networks via Bayesian optimization and structured prediction

EBiDA-FPN: Enhanced Bi-Directional Attention Feature Pyramid Network for Object Detection

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection.

Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information

High Quality Object Detection for Multiresolution Remote Sensing Imagery Using Cascaded Multi-Stage Detectors.

Cascaded Convolutional Neural Networks for Object Detection.

Multi-scale Object Detection by Top-Down and Bottom-Up Feature Pyramid Network

CA2Det: Cascaded Adaptive Fusion Pyramid Network Based on Attention Mechanism for Small Object Detection

Cascade R-CNN: Delving into High Quality Object Detection

Chained Cascade Network For Object Detection

Dual Attention Based Image Pyramid Network for Object Detection.

Reinforcedet - Object Detection by Integrating Reinforcement Learning with Decoupled Pipeline.

Cascaded Detection Framework Based on a Novel Backbone Network and Feature Fusion

Object Detection Algorithm Based on Channel-Spatial Fusion Cascade Attention

Action-Driven Object Detection with Top-Down Visual Attentions

Cascade Meta-RCNN for Few-shot Object Detection

C-RPNs: Promoting object detection in real world via a cascade structure of Region Proposal Networks.

Object detection based on an adaptive attention mechanism

Attention-Enhanced and More Balanced R-CNN for Object Detection

Object Detection with Class Aware Region Proposal Network and Focused Attention Objective