Abstract:Natural image matting focuses on accurately estimating the opacity of the foreground object in an arbitrary background. Recently, deep learning-based approaches made significant progress in the matting task benefit from their powerful learning ability for semantic features. However, artifacts, blurry structures, and miscalculated pixels still often appear in some difficult regions with background interference and complex details. To address the above issues, we propose a cross-layer contextual information propagation mechanism (CCIP) that can explicitly model the long-range correlations between global and unknown regions. Specifically, we first calculate region affinity at high-level features with rich structure and semantic information; then reconstruct the adjacent low-level features by propagating information from the global region to the unknown region under the guidance of the affinity matrix; finally, transfer the reconstructed information to the corresponding decoder stage to further improve the feature distinctiveness. In addition, we design a simple and effective supervision strategy in a deep-to-shallow manner to gradually optimize the edges and details of the foreground object. We conducted extensive experiments on the common dataset Composition-1k, the alphamatting.com benchmark, and some real-world images. Compared with previous methods, the proposed method achieves competitive performance on the Composition-1k dataset (30.3 on SAD, 6.8 on MSE, 13.3 on Grad, and 26.7 on Con) and alphamatting.com benchmark (17 on average SAD rank and 16.8 on average Grad rank), while simultaneously yielding high-quality matting results on real-world images.

Natural Image Matting with Shifted Window Self-Attention.

Effective Local-Global Transformer for Natural Image Matting

TransMatting: Enhancing Transparent Objects Matting with Transformers

Highly Efficient Natural Image Matting

Natural Image Matting with Attended Global Context

Multi-Task Affinity Propagation Based Natural Image Matting

From Composited to Real-world: Transformer-based Natural Image Matting

High-Resolution Deep Image Matting

Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation

Natural Image Matting via Guided Contextual Attention

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

A Late Fusion CNN for Digital Matting

ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers

Deep Image Matting: A Comprehensive Survey

Deep image matting with cross-layer contextual information propagation

PP-Matting: High-Accuracy Natural Image Matting

Memory Efficient Matting with Adaptive Token Routing

VMFormer: End-to-End Video Matting with Transformer.

Automatic Framework for Highly Efficient Natural Image Matting

Morpho-Aware Global Attention for Image Matting

Towards Enhancing Fine-grained Details for Image Matting