Abstract:Shadow removal and segmentation remain challenging tasks in computer vision, particularly in complex real-world scenarios. This study presents a novel approach that enhances the ShadowFormer model by incorporating Masked Autoencoder (MAE) priors and Fast Fourier Convolution (FFC) blocks, leading to significantly faster convergence and improved performance. We introduce key innovations: (1) integration of MAE priors trained on Places2 dataset for better context understanding, (2) adoption of Haar wavelet features for enhanced edge detection and multi-scale analysis, and (3) implementation of a modified SAM Adapter for robust shadow segmentation. Extensive experiments on the challenging DESOBA dataset demonstrate that our approach achieves state-of-the-art results, with notable improvements in both convergence speed and shadow removal quality.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenges of shadow segmentation and removal in the field of computer vision, especially in complex real - world scenes. Specifically: 1. **Difficulties in shadow segmentation and removal**: In complex real - world environments, there are complex interactions between shadows and objects, which makes accurate shadow segmentation and removal very difficult. 2. **Limitations of existing methods**: - Traditional methods rely on physical models and heuristic rules based on chromaticity, intensity, and texture, and these methods have limited effectiveness when dealing with complex scenes. - Although machine - learning methods have improved by combining hand - designed features (such as texture, color, edges) with classifiers (such as SVM and decision trees), they are still insufficient in comprehensively characterizing shadows. - Deep - learning methods have significantly improved performance through automatic feature extraction, but they still have problems such as slow convergence speed and poor performance when dealing with complex scenes. 3. **The need to improve performance**: In order to achieve faster convergence speed and higher shadow - removal quality in complex real - world scenes, new techniques and methods need to be introduced to enhance the performance of existing models. For this purpose, the paper proposes a novel method to improve shadow segmentation and removal by integrating the following key techniques: - **Masked Autoencoder (MAE) prior**: Use the MAE trained on the Places2 dataset to provide better context understanding. - **Haar wavelet features**: Used to enhance edge detection and multi - scale analysis and improve the accuracy of shadow segmentation. - **Improved SAM Adapter**: Improve the effect of shadow segmentation by generating more robust shadow masks. - **Fast Fourier Convolution (FFC) block**: Used to better capture global context information and accelerate model convergence. Through these innovations, the paper aims to significantly improve the performance of shadow segmentation and removal, especially in complex real - world scenes. Experimental results show that this method has achieved state - of - the - art results on the DESOBA dataset, showing faster convergence speed and higher shadow - removal quality.

WavShadow: Wavelet Based Shadow Segmentation and Removal

SoftShadow: Leveraging Penumbra-Aware Soft Masks for Shadow Removal

Detect Any Shadow: Segment Anything for Video Shadow Detection

ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier Transformer

ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal

Robust Shadow Detection by Exploring Effective Shadow Contexts

Shadow-Aware Dynamic Convolution for Shadow Removal.

Automatic Shadow Detection and Removal from a Single Image

SpA-Former: Transformer image shadow detection and removal via spatial attention

SAM-helps-Shadow:When Segment Anything Model meet shadow removal

Fine-Context Shadow Detection using Shadow Removal

S2Net: Shadow Mask-Based Semantic-Aware Network for Single-Image Shadow Removal

Predicted Virtual Soft Shadow Maps with High Quality Filtering

Exponential Soft Shadow Mapping

ShadowMaskFormer: Mask Augmented Patch Embeddings for Shadow Removal

ShadowRemovalNet: Efficient Real-Time Shadow Removal

Shadow Removal Refinement via Material-Consistent Shadow Edges

Shadow Removal via Shadow Image Decomposition

From Shadow Segmentation to Shadow Removal

Efficient Model-Driven Network for Shadow Removal

ShaDocFormer: A Shadow-Attentive Threshold Detector With Cascaded Fusion Refiner for Document Shadow Removal