Abstract:The demands of high-quality videos captured by camera become bigger due to the rapid development of pattern recognition and artificial intelligence. Video denoising is the key technology to obtain clear videos. However, the research on video denoising is far from enough now. In this paper, we propose a video denoising method based on convolutional neural network architecture to reduce the noise from the sensor system. We improve the loss function of noise estimation by imposing adaptive penalty on under-estimation error of noise level which makes our method perform robustly. Furthermore, we make use of multi-level features to guide the spatial denoising, where multilayer semantic information of the image is regarded as the perceptual loss. Instead of relying on Optical Flow solving the characterization of inter-frame information, we utilize U-Net-like structure to handle motion implicitly. It is less computationally expensive and avoids distortions caused by inaccurate flow and object occlusion. In order to locate temporal features and suppress useless information, the attention mechanism is introduced to the skip connections of the U-Net-like structure. Experimental results demonstrate that the proposed algorithm outputs more convincing results in both peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) indexes when processing Gaussian noise, synthetic real noise, and real noise compared with selected approaches.

Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel Transformer

Spatial-Adaptive Network for Single Image Denoising

Image Denoising Via Multi-Scale Gated Fusion Network

A Dynamic Network with Transformer for Image Denoising

Monte Carlo Denoising Via Multi-scale Auxiliary Feature Fusion Guided Transformer.

DDT: Dual-branch Deformable Transformer for Image Denoising

Spatio-Temporal Video Denoising Based on Attention Mechanism

Hybrid Transformer-CNN for Real Image Denoising

Low-Light Raw Video Denoising with a High-Quality Realistic Motion Dataset

First image then video: A two-stage network for spatiotemporal video denoising

Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers

Unsupervised Coordinate-Based Video Denoising

A cross Transformer for image denoising

A Practical Gated Recurrent Transformer Network Incorporating Multiple Fusions for Video Denoising

Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising

Single image denoising with a feature-enhanced network

Image Denoising Using Channel Attention Residual Enhanced Swin Transformer

Channel and Space Attention Neural Network for Image Denoising

Two-stage Progressive Residual Dense Attention Network for Image Denoising

TDNet: transformer-based network for point cloud denoising.

Gated Recurrent Unit for Video Denoising