Abstract:In this paper, an effective low bit-rate video coding scheme is developed to realize state-of-the-art video coding efficiency with lower encoder complexity, while supporting standard compliance and error resilience. Such an architecture is particularly attractive for application scenarios involving resource-deficient wireless video communications. At the encoder, in order to increase resilience to channel error, multiple descriptions of a video sequence are generated in the spatio-temporal domain by temporal multiplexing and spatial adaptive downsampling. The resulting side descriptions are interleaved with each other in temporal domain, while still with conventional square sample grids in spatial domain. As such, each side description can be compressed without any change to existing video coding standards. At the decoder, each side description is first decompressed, and then reconstructed to the original resolution with the help of the other side description. In this procedure, the decoder recovers the original video sequence in a constrained least squares regression process, in which 2D or 3D piecewise autoregressive model is adaptively chosen according to different predictive modes. In this way, the spatial and temporal correlation is sufficiently explored to achieve superior quality. Experimental results demonstrate that the proposed video coding scheme outperforms H.264/AVC and other state-of-the-art methods in rate-distortion performance at low bit-rates and achieves superior visual quality at medium bit rates as well, while with lower encoding computational complexity.

Low Bitrates Audio Object Coding Using Convolutional Auto-Encoder and Densenet Mixture Model.

Adaptive subband partition encoding scheme for multiple audio objects using CNN and residual dense blocks mixture network

Stacked Sparse Autoencoder for Audio Object Coding.

Sparse Autoencoder Based Multiple Audio Objects Coding Method

Audio-Visual Speech Enhancement with Deep Multi-modality Fusion

An Improved Method for Scalable Video Coding at Low Bit Rates

A High Fidelity and Low Complexity Neural Audio Coding

InSE-NET: A Perceptually Coded Audio Quality Model based on CNN

A Audio Coding Algorithm for Interactive Communications

MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

Distributed Audio Coding in Wireless Sensor Networks

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Low Bit-Rate Video Coding Via Mode-Dependent Adaptive Regression for Wireless Visual Communications.

Model-based Low Bit-Rate Video Coding for Resource-Deficient Wireless Visual Communication

SpatialCodec: Neural Spatial Speech Coding

Neural Audio Coding with Deep Complex Networks

Efficient Design of MPEG Advanced Audio Encoder

Hybrid model-and-object-based real-time conversational video coding

An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec