Abstract:Although wavelet-based scalable video coding becomes the state-of-the-art video compression engine for its adaptability to heterogeneous networks and clients, a large number of attempts have been made to integrate local directionality onto discrete wavelet transform to explore the intrinsic geometrical structures. Taking into consideration that the contours and textures scattered in different scales change their directional resolutions as their curvatures change, we investigate adaptive directional resolutions along scales to achieve the dual (scale and orientation) multiresolution transform. This paper proposes nonuniform directional frequency decompositions for video representation and approximation, and exploits the nonuniformity of orientation multiresolution distribution and designs nonuniform directional filter banks to make the geometrical transform more sparse and efficient. The nonuniform directional frequency decomposition under arbitrary scales is fulfilled by a non-symmetric binary tree (NSBT) topology structure with nonuniform directional filterbank design. In turn, the proposed scalable video coding framework, called DMSVC, is enriched with the dual multiresolution transform. Each temporal subband through motion compensated temporal filtering is further decomposed into multiscale subbands, and the highpass wavelet subspaces are divided into an arbitrary number of directional subspaces in alignment with the orientation distribution via phase congruency to establish NSBT. The paraunitary perfect reconstruction condition is provided through a polyphase identical form of filter bank. Comparing with the isolated wavelet basis, our transform provides a greater correlated set of localized and anisotropic basis functions. The spatio-temporal subband coefficients are coded by a 3-D ESCOT entropy coding algorithm which is adopted to match the structure of NSBT. Experimental results show that the reconstructed video frames DMSVC in the proposed DMSVC scheme have better visual quality than existing scalable video coding schemes. It could produce higher compression ratio on video sequences full of directional edges and textures.

A Generic Video Coding Framework Based on Anisotropic Diffusion and Spatio-Temporal Completion

Generic video coding with abstraction and detail completion

Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement

Extended application of scalable video coding methods

A New Framework Based on Spatio-Temporal Information for Enhancing Compressed Video

Generalized In-Scale Motion Compensation Framework for Spatial Scalable Video Coding.

Video Coding with Spatio-Temporal Texture Synthesis

A Coding Framework and Benchmark towards Low-Bitrate Video Understanding

A software-only videocodec using pixelwise conditional differential replenishment and perceptual enhancements

Studies On Spatial Scalable Frameworks For Motion Aligned 3d Wavelet Video Coding

Efficient and universal scalable video coding

Scalable Video Compression Framework with Adaptive Orientational Multiresolution Transform and Nonuniform Directional Filterbank Design

Temporal context video compression with flow-guided feature prediction

A novel video coding framework by perceptual representation and macroblock-based matching pursuit algorithm

FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space

Learning-Based Video Compression Framework With Implicit Spatial Transform for Applications in the Internet of Things

A Survey on Perceptually Optimized Video Coding

I$^2$VC: A Unified Framework for Intra- & Inter-frame Video Compression

Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression

An Efficient Coding Framework For Compact Descriptors Extracted From Video Sequence