Abstract:In multi-view video systems, the decoded texture video and its corresponding depth video are utilized to synthesize virtual views from different perspectives using the depth-image-based rendering (DIBR) technology in 3D-high efficiency video coding (3D-HEVC). However, the distortion of the compressed multi-view video and the disocclusion problem in DIBR can easily cause obvious holes and cracks in the synthesized views, degrading the visual quality of the synthesized views. To address this problem, a novel two-stream re-parameterized refocusing hybrid attention (TRRHA) network is proposed to significantly improve the quality of synthesized views. Firstly, a global multi-scale residual information stream is applied to extract the global context information by using refocusing attention module (RAM), and the RAM can detect the contextual feature and adaptively learn channel and spatial attention feature to selectively focus on different areas. Secondly, a local feature pyramid attention information stream is used to fully capture complex local texture details by using re-parameterized refocusing attention module (RRAM). The RRAM can effectively capture multi-scale texture details with different receptive fields, and adaptively adjust channel and spatial weights to adapt to information transformation at different sizes and levels. Finally, an efficient feature fusion module is proposed to effectively fuse the extracted global and local information streams. Extensive experimental results show that the proposed TRRHA achieves significantly better performance than the state-of-the-art methods. The source code will be available at https://github.com/647-bei/TRRHA .

TPARN: A Network for Enhancing Synthetic Video Quality after 3D-HEVC Encoding

TRRHA: A two-stream re-parameterized refocusing hybrid attention network for synthesized view quality enhancement

Convolutional Neural Network Based Synthesized View Quality Enhancement for 3D Video Coding

Synthesis-Aware Region-Based 3D Video Coding.

High-Efficiency 3D Depth Coding Based on Perceptual Quality of Synthesized Video.

A Robust Quality Enhancement Method Based on Joint Spatial-Temporal Priors for Video Coding

Video Encoding Enhancement Via Content-Aware Spatial and Temporal Super-Resolution

Deep Learning-based Perceptual Video Quality Enhancement for 3D Synthesized View

Global-Context Aggregated Intra Prediction Network for Depth Video Coding.

Texture-Aware Depth Prediction in 3D Video Coding.

Decomposition, Compression, and Synthesis (DCS)-based Video Coding: A Neural Exploration via Resolution-Adaptive Learning

Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding

Adaptive view synthesis optimization for low complexity 3D-HEVC encoding.

FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement.

BSTN: an Effective Framework for Compressed Video Quality Enhancement

HPC: Hierarchical Progressive Coding Framework for Volumetric Video

Multi-Layer Features Fusion Model-Guided Low-Complexity 3D-HEVC Intra Coding

Advanced residual prediction enhancement for 3D-HEVC

Neural video compression using patio-temporal priors

Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement

Deep Residual Network for Enhancing Quality of the Decoded Intra Frames of Hevc