Abstract:To resolve the problem that the segmentation result of the full convolutional neural network in the Mask R-CNN model is not fine enough, and that the number of loss function hyperparameters is too large, leadings to the time and resource consumption of parameter adjustment, we propose a parameter link and efficient instance segmentation model in this paper. Aiming at the problem that the Mask R-CNN model does not pay attention to sample features, the method of fusing the visual attention network in the ResNet50 backbone network is adopted to achieve self-adaptation and long-range correlation in self-attention, so that the model can precisely recognize the target location and effectively detect and segment the target. The U-Net network is introduced into the segmentation, and the image is processed by stepwise upsampling and downsampling, so that the network segmentation accuracy for the pixel mask is more accurate. Considering the parameter tuning problem of the instance segmentation task, a parameter link loss is recommended to simplify the complexity of model training parameter tuning and further enhance the detection and segmentation performance of the model. We conduct extensive experiments on three extensive baselines, i.e., MiniCOCO, Cityscapes and PASCAL VOC2012, to assess the validity of our model. The experimental findings demonstrate that (1) in the MiniCOCO dataset, a box AP of 35.1 and a mask AP of 32.0 are obtained. Compared with the most advanced mask2former algorithm, the box AP and mask AP are 1.7 and 2.2 higher, respectively. (2) The AP value on Cityscapes is 38.1. In comparison with alternative instance segmentation models, the mAP of each category has been greatly improved. (3) The generalization experiment of our model on the PASCAL VOC2012 dataset shows that the box mAP and mask mAP are 75.5 and 63.6, respectively, which are improved by 3.9 and 1.9, respectively, when contrasting with the Mask R-CNN model. Our model has significant advantages in both detection and segmentation. The code will be available at https://gitee.com/zhiweilu111/simple-mask/tree/master.

Parameter-Efficient Masking Networks

Masked Autoencoders for Point Cloud Self-supervised Learning.

One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning

Mask-Pyramid Network: A Novel Panoptic Segmentation Method

MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression

SimpleMask: parameter link and efficient instance segmentation

Efficient Masked Autoencoders with Self-Consistency

Triple Point Masking

Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders

You Can Mask More For Extremely Low-Bitrate Image Compression

Effective Sparsification of Neural Networks with Global Sparsity Constraint

Masked Autoencoders are Parameter-Efficient Federated Continual Learners

Pre-training Point Cloud Compact Model with Partial-aware Reconstruction

HyperMask: Adaptive Hypernetwork-based Masks for Continual Learning

Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)

Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token

Exploring Effective Mask Sampling Modeling for Neural Image Compression

PolyMaX: General Dense Prediction with Mask Transformer

Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning

Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression

MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning