Abstract:Utilizing trimap guidance and fusing multi-level features are two important issues for trimap-based matting with pixel-level prediction. To utilize trimap guidance, most existing approaches simply concatenate trimaps and images together to feed a deep network or apply an extra network to extract more trimap guidance, which meets the conflict between efficiency and effectiveness. For emerging content-based feature fusion, most existing matting methods only focus on local features which lack the guidance of a global feature with strong semantic information related to the interesting object. In this paper, we propose a trimap-guided feature mining and fusion network consisting of our trimap-guided non-background multi-scale pooling (TMP) module and global-local context-aware fusion (GLF) modules. Considering that trimap provides strong semantic guidance, our TMP module focuses effective feature mining on interesting objects under the guidance of trimap without extra parameters. Furthermore, our GLF modules use global semantic information of interesting objects mined by our TMP module to guide an effective global-local context-aware multi-level feature fusion. In addition, we build a common interesting object matting (CIOM) dataset to advance high-quality image matting. Particularly, results on the Composition-1k and our CIOM show that our TMFNet achieves 13% and 25% relative improvement on SAD, respectively, against a strong baseline with fewer parameters and 14% fewer FLOPs. Experimental results on the Composition-1k test set, Alphamatting benchmark, and our CIOM test set demonstrate that our method outperforms state-of-the-art approaches. Our code and models are available at <a class="link-external link-https" href="https://github.com/Serge-weihao/TMF-Matting" rel="external noopener nofollow">this https URL</a>.

TransMatting: Enhancing Transparent Objects Matting with Transformers

Natural Image Matting with Shifted Window Self-Attention.

A Late Fusion CNN for Digital Matting

MatteFormer: Transformer-Based Image Matting via Prior-Tokens

Enhancing Transparent Object Matting Using Predicted Definite Foreground and Background

Highly Efficient Natural Image Matting

Boosting General Trimap-free Matting in the Real-World Image

Attention-guided Temporally Coherent Video Object Matting

Disentangled Image Matting

Effective Local-Global Transformer for Natural Image Matting

To-Former: semantic segmentation of transparent object with edge-enhanced transformer

From Composited to Real-world: Transformer-based Natural Image Matting

Memory Efficient Matting with Adaptive Token Routing

Text-Guided Portrait Image Matting

Trimap-guided Feature Mining and Fusion Network for Natural Image Matting

ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers

Matte Anything: Interactive Natural Image Matting with Segment Anything Models

Semantic-guided Automatic Natural Image Matting with Trimap Generation Network and Light-weight Non-local Attention

Semantic Image Matting

Matte anything: Interactive natural image matting with segment anything model

Segmenting Transparent Object in the Wild with Transformer