Abstract:An instance mining approach is proposed to tackle the challenge of incomplete region localization for image‐level weakly supervised instance segmentation tasks by discovering more unseen instances and more complete object regions. The challenge of noisy mask prediction is tackled by integrating class confidence to obtain more reliable and cleaner instance masks through an instance filtering approach. On the Pascal VOC 2012 datasets and COCO dataset, an implementation of the model with popular DCNNs, e.g. ResNet50, substantially improves the performance of this task Learning the full extent of pixel‐level instance response in a weakly supervised manner remains unsatisfactory. Peak response maps (PRMs) localizes the discriminative object regions but cannot provide complete instance information, suffering from incomplete segmentation and unreliable mask prediction by noisy proposal retrieval. This work tackles this challenging problem by mining diverse class peak responses that include more discriminative and complete object regions and retrieving more reliable proposals from noisy segment proposal galleries. First, the existing method is enhanced with two more classification branches, thus contributing to more diverse and abundant instance regions from peak response maps. The mined class peak responses from two of the branches are then merged to generate more complete peak response maps by a clustering approach in their deep feature space. Then, instance segmentation masks are retrieved from a noisy object segment proposal gallery with class confidence, which is calculated by a normal classifier to obtain cleaner mask prediction. Finally, the pseudo‐supervision can be used to train an instance segmentation network in a fully supervised manner. Experiments on the PASCAL VOC 2012 dataset and COCO dataset show that the approach works effectively and outperforms other counterparts by a margin of more than 6 %, 4%, and 3% with the mean average precision (mAP) at IoU threshold of 0.25, 0.5 and 0.75, respectively.

Weakly-supervised Instance Segmentation via Class-agnostic Learning with Salient Images

Weakly Supervised Instance Segmentation Using Multi-Prior Fusion.

Weakly Supervised Instance Segmentation by Exploring Entire Object Regions

Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes

Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution

Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting

Box-supervised Instance Segmentation with Level Set Evolution

Deep Level Set for Box-supervised Instance Segmentation in Aerial Images

Weakly Supervised Instance Segmentation by Deep Community Learning

Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation

Toward High Quality Multi-Object Tracking and Segmentation Without Mask Supervision

Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping

BoxInst: High-Performance Instance Segmentation with Box Annotations.

Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework With Spatio-Temporal Collaboration

Where are the Masks: Instance Segmentation with Image-level Supervision

Weakly supervised instance segmentation via peak mining and filtering

Weakly Supervised Semantic Segmentation Based on Co-segmentation.

Boosting Box-supervised Instance Segmentation with Pseudo Depth

Weakly Supervised 3D Instance Segmentation without Instance-level Annotations