Adversarial Distortion for Learned Video Compression

Vijay Veerabadran,Reza Pourreza,Amirhossein Habibian,Taco Cohen

DOI: https://doi.org/10.48550/arXiv.2004.09508

2021-06-19

Abstract:In this paper, we present a novel adversarial lossy video compression model. At extremely low bit-rates, standard video coding schemes suffer from unpleasant reconstruction artifacts such as blocking, ringing etc. Existing learned neural approaches to video compression have achieved reasonable success on reducing the bit-rate for efficient transmission and reduce the impact of artifacts to an extent. However, they still tend to produce blurred results under extreme compression. In this paper, we present a deep adversarial learned video compression model that minimizes an auxiliary adversarial distortion objective. We find this adversarial objective to correlate better with human perceptual quality judgement relative to traditional quality metrics such as MS-SSIM and PSNR. Our experiments using a state-of-the-art learned video compression system demonstrate a reduction of perceptual artifacts and reconstruction of detail lost especially under extremely high compression.

Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning

What problem does this paper attempt to address?

This paper attempts to solve the problem of unpleasant reconstruction artifacts (such as blocking effect, ringing effect, etc.) in video compression at very low bit rates. Although the existing neural - network - based learned video compression methods have achieved certain success in reducing the bit rate for efficient transmission and have reduced the influence of artifacts to a certain extent, they still tend to produce blurry results under extreme compression conditions. For this reason, the paper proposes a new video compression model with adversarial loss, which improves the perceptual quality by minimizing the auxiliary adversarial distortion objective. Experiments show that this model can reduce perceptual artifacts and restore details under extremely high compression conditions. Specifically, the main contributions of the paper include: 1. Proposing the use of adversarial loss to improve the perceptual quality of learned video compression. 2. Studying the techniques of using adversarial loss for training to improve the stability of training. 3. Studying the method of spatio - temporal decomposition discriminator to support the end - to - end training of deep video compression networks. These improvements aim to make the video compression system improve the visual quality of videos while maintaining a low bit rate, especially under extreme compression conditions.

Adversarial Distortion for Learned Video Compression

Learned Low Bit-rate Image Compression with Adversarial Mechanism

Video Compression With Rate-Distortion Autoencoders

An Imperceptible Adversarial Attack Against Reconstruction for Learned Image Compression

High Visual-Fidelity Learned Video Compression

Multi-rate Adaptive Transform Coding for Video Compression

RoVISQ: Reduction of Video Service Quality via Adversarial Attacks on Deep Learning-based Video Compression

Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement

Towards Robust Neural Image Compression: Adversarial Attack and Model Finetuning

Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement

Feedback Recurrent Autoencoder for Video Compression

Toward Robust Neural Image Compression: Adversarial Attack and Model Finetuning

On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks

MALICE: Manipulation Attacks on Learned Image ComprEssion

On the Choice of Perception Loss Function for Learned Video Compression

NetFlick: Adversarial Flickering Attacks on Deep Learning Based Video Compression

Research on video adversarial attack with long living cycle.

Slimmable Video Codec

On the Robustness of Neural-Enhanced Video Streaming against Adversarial Attacks

Versatile Learned Video Compression

Object-aware Image Compression with Adversarial Learning.