Abstract:Deep learning methods in computer vision have shown tremendous progress recently. Most image restoration methods are based on single feature extraction. However, image restoration requires various types of information to reconstruct the properties such as structure, color, texture details. Low-quality pictures are widespread in the real world because of the natural conditions and unstable photography. Therefore, we propose a double feature fusion network (DFFNet) to overcome this challenge in image inpainting. DFFNet extracts deblurring and structure features simultaneously and adopts multipath refinement framework aided by multipath contextual attention modules to restore the mask region in a coarse-to-fine manner. Thus, our work can restore blurred and large masked images into sharp and complete images and neglect the meaningless occlusions. It outperforms the state-of-the-art combined methods for sharper and smarter inpainting. First, we design a multi-path refinement network that can extract multi-features for further refinement and coarse inpainting. Second, multipath contextual attention modules are reconstructed for receiving edge and deblur features and finer inpainting results. Third, multi-stage synthesis loss function and double feature fusion units ensure that the restored images have the same structure and texture similarity as the original ground-truth counterparts. Therefore, the progressive learning framework DFFNet adopts key elements of image restoration information such as structure and texture, and refines the background for removing meaningless noise for sharper and smarter inpainting.

Progressive Temporal Feature Alignment Network for Video Inpainting

Temporal Adaptive Alignment Network for Deep Video Inpainting.

Temporal Group Fusion Network for Deep Video Inpainting

Structure-Guided Deep Video Inpainting

Align-and-Attend Network for Globally and Locally Coherent Video Inpainting

Learning Joint Spatial-Temporal Transformations for Video Inpainting

Recurrent Temporal Aggregation Framework for Deep Video Inpainting

A Temporally-Aware Interpolation Network for Video Frame Inpainting

FSTT: Flow-Guided Spatial Temporal Transformer for Deep Video Inpainting

Video Inpainting by Jointly Learning Temporal Structure and Spatial Details

DANet: Deformable Alignment Network for Video Inpainting

Frame-Recurrent Video Inpainting by Robust Optical Flow Inference

Deep Transformer Based Video Inpainting Using Fast Fourier Tokenization

A Double Feature Fusion Network with Progressive Learning for Sharper Inpainting

WTVI: A Wavelet-Based Transformer Network for Video Inpainting

VORNet: Spatio-temporally Consistent Video Inpainting for Object Removal

Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection

Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting

Learnable Gated Temporal Shift Module for Deep Video Inpainting

3DPF-FBN: Video Inpainting by Jointly 3D-Patch Filling and Neural Network Refinement

ProPainter: Improving Propagation and Transformer for Video Inpainting