Abstract:Natural image matting separates the foreground from background in fractional occupancy which can be caused by highly transparent objects, complex foreground (e.g., net or tree), and/or objects containing very fine details (e.g., hairs). Although conventional matting formulation can be applied to all of the above cases, no previous work has attempted to reason the underlying causes of matting due to various foreground semantics. We show how to obtain better alpha mattes by incorporating into our framework semantic classification of matting regions. Specifically, we consider and learn 20 classes of matting patterns, and propose to extend the conventional trimap to semantic trimap. The proposed semantic trimap can be obtained automatically through patch structure analysis within trimap regions. Meanwhile, we learn a multi-class discriminator to regularize the alpha prediction at semantic level, and content-sensitive weights to balance different regularization losses. Experiments on multiple benchmarks show that our method outperforms other methods and has achieved the most competitive state-of-the-art performance. Finally, we contribute a large-scale Semantic Image Matting Dataset with careful consideration of data balancing across different semantic classes.

What problem does this paper attempt to address?

This paper attempts to solve several key problems in Natural Image Matting: 1. **Lack of semantic information**: Traditional matting methods mainly rely on low - level image features, such as color or structure, without considering the semantic information of foreground objects. This leads to the failure of traditional methods when dealing with complex foregrounds (such as hair, meshes, trees, etc.). 2. **Boundary and pattern diversity of different foreground objects**: Different types of boundaries or patterns may exist around different foreground objects. For example, a portrait usually has both a fuzzy hair area and a clear body boundary. Existing methods perform poorly when dealing with these diverse boundaries and patterns. 3. **Limitations of the traditional Trimap**: The traditional Trimap only contains information about the foreground, background, and unknown regions, lacking semantic classification of pixels within the unknown regions, which limits the performance of the model in complex scenarios. To solve the above problems, the paper proposes a new framework to improve the effect of natural image matting by introducing **semantic classification**. Specifically, the main contributions of the paper include: 1. **Introducing semantic information**: Semantic classification is introduced for the first time in the natural image matting task, showing how to use semantic information to achieve the most competitive performance. 2. **Expanding the traditional Trimap**: The **Semantic Trimap** is proposed, which contains multi - channel confidence maps for each unknown region to represent the confidence of different matting categories. 3. **Multi - class discriminator**: A multi - class discriminator is introduced to provide supervision at the semantic level to regularize the alpha prediction results and introduce content - sensitive weights to balance different regularization losses. 4. **Constructing a large - scale dataset**: Contribute the first large - scale and class - balanced **Semantic Image Matting Dataset**, covering a wide range of matting patterns, providing new perspectives and in - depth analysis for future matting research. Through these innovations, the method proposed in the paper has achieved state - of - the - art performance in multiple benchmark tests, especially when dealing with complex boundaries and patterns of different foreground objects.

Semantic Image Matting

Salient Image Matting

Semantic-guided Automatic Natural Image Matting with Trimap Generation Network and Light-weight Non-local Attention

Semantic Human Matting

Disentangled Image Matting

PP-Matting: High-Accuracy Natural Image Matting

Highly Efficient Natural Image Matting

Towards Natural Image Matting in the Wild via Real-Scenario Prior

Weakly Supervised Image Matting Via Patch Clustering

Multi-guided-based image matting via boundary detection

Matte Anything: Interactive Natural Image Matting with Segment Anything Models

Coarse Semantic Guided Alpha Matting Via Simultaneous Foreground and Background Estimation

Matte anything: Interactive natural image matting with segment anything model

Attention-guided Temporally Coherent Video Object Matting

Boosting General Trimap-free Matting in the Real-World Image

Deep Image Matting with Sparse User Interactions.

Robust Human Matting via Semantic Guidance

Matting Anything

Deep Automatic Natural Image Matting

Confidence-driven Image Co-Matting.

Boosting Semantic Human Matting with Coarse Annotations.