Abstract:Weakly supervised object detection (WSOD) has attracted attention increasingly in object detection, as it only requires image-level annotations to train the detector. A typical paradigm for WSOD is to first generate candidate region proposals for the training data, and then each image is treated as a bag of proposals to conduct the training based on the multiple instance learning (MIL). Most methods focus on optimizing the training process, but rarely consider the influence of pre-generated proposals that directly affect the learning of the detector, due to the overwhelming noisy proposals (e.g., negative or background proposals) and positive proposals with inaccurate locations. In this paper, we focus on improving the quality of proposals, and propose a recurrent self-optimizing proposal framework, a new paradigm for WSOD, to iteratively optimize the pre-generated proposals. In each iteration, all detection results (i.e., the object-aware coordinate offsets and the confidence scores) are accumulated for proposal optimization. To achieve accurate object location, we design a proposal self-transformation module to transform the locations of pre-generated proposals based on the coordinate offsets. To alleviate the impact of noise proposals, we design a proposal self-sampling module to mine object instances through confidence scores to filter out noisy proposals. Furthermore, these optimized proposals are fed into a decoupled proposal learner, which contains two parallel proposal training branches. A MIL module and an instance refinement module are supervised by the image label and the mined object instances, respectively. In addition, the instance refinement module contains an instance regression refinement module, which is proposed to generate object-aware coordinate offsets. In turn, the decoupled proposal learner produces the new detection results to optimize proposals in the next iteration. Extensive experiments on PASCAL VOC and MS-COCO datasets demonstrate the effectiveness of our method.

Recurrent Self-Optimizing Proposals for Weakly Supervised Object Detection

Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection.

SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection

High-Quality Proposals for Weakly Supervised Object Detection.

Self-Guided Proposal Generation for Weakly Supervised Object Detection

HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection

Hierarchical Region Proposal Refinement Network for Weakly Supervised Object Detection

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection

A Dual-Network Progressive Approach to Weakly Supervised Object Detection.

Self-Training-Based Semantic-Balanced Network for Weakly Supervised Object Detection in Remote-Sensing Images

Optimizing Region Selection for Weakly Supervised Object Detection

WSODPB: Weakly Supervised Object Detection with PCSNet and Box Regression Module.

Weakly Supervised Object Detection for Remote Sensing Images via Progressive Image-Level and Instance-Level Feature Refinement

Refining and reweighting pseudo labels for weakly supervised object detection

Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection

MOL: Towards Accurate Weakly Supervised Remote Sensing Object Detection Via Multi-view Noisy Learning

Weakly Supervised Open-Vocabulary Object Detection

Saliency Guided End-to-end Learning Forweakly Supervised Object Detection

Weakly Supervised Object Detection with Symmetry Context

Saliency Guided End-to-End Learning for Weakly Supervised Object Detection.