Abstract:A challenging task is to make sure that the deep learning network learns prediction accuracy by itself. Intersection-over-Union (IoU) amidst ground truth and instance mask determines mask quality. There is no relationship between classification score and mask quality. The mission is to investigate this problem and learn the predicted instance mask’s accuracy. The proposed network regresses the MaskIoU by comparing the predicted mask and the respective instance feature. The mask scoring strategy determines the disorder among mask score and mask quality, then adjusts the parameters accordingly. Adaptation ability to the object’s geometric variations decides deformable convolutional network’s performance. Using increased modeling power and stronger training, focusing ability on pertinent image regions is improved by a reformulated Deformable ConvNets. The introduction of modulation technique, which broadens the deformation modeling scope, and the integration of deformable convolution comprehensively within the network enhance the modeling power. The features which resemble region-based convolutional neural network (R-CNN) feature’s classification capability and its object focus are learned by the network with the help of feature mimicking scheme of DCNv2. Feature mimicking scheme of DCNv2 guides the network training to efficiently control this enhanced modeling capability. The backbone of the proposed Mask Scoring R-CNN network is designed with ResNet-152 FPN and DCNv2 network. The proposed Mask Scoring R-CNN network with DCNv2 network is also tested with other backbones ResNet-50 and ResNet-101. Instance segmentation and object detection on COCO benchmark and Cityscapes dataset are achieved with top accuracy and improved performance using the proposed network.

Curvature-Driven Deformable Convolutional Networks for End-To-End Object Detection

An Efficient Compressive Convolutional Network for Unified Object Detection and Image Compression

Adaptive deformable convolutional network

Improving object detection with deep convolutional networks via Bayesian optimization and structured prediction

Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery

An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks

Object Detection in VHR Image Using Transfer Learning with Deformable Convolution

DCNet: A Deformable Convolutional Cloud Detection Network for Remote Sensing Imagery

DIGCN: A Dynamic Interaction Graph Convolutional Network Based on Learnable Proposals for Object Detection

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

Contour deformation network for instance segmentation

Deep Convolutional Feature Enhancement for Remote Sensing Object Detection

Deformable Capsules for Object Detection

Untangling Local and Global Deformations in Deep Convolutional Networks for Image Classification and Sliding Window Detection

Entire Deformable ConvNets for semantic segmentation

DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection

Fusing DCN and BBAV for Remote Sensing Image Object Detection

Object Contour Detection with a Fully Convolutional Encoder-Decoder Network

Deep-Learning-Based Object-Level Contour Detection With Ccg And Crf Optimization

Improvement of Bounding Box and Instance Segmentation Accuracy Using ResNet-152 FPN with Modulated Deformable ConvNets v2 Backbone-based Mask Scoring R-CNN

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection