Abstract:To provide excellent visual experience for customers, virtual reality (VR) sources require higher resolutions and better visual quality than traditional picture sequences. The content of a VR video can be mapped into a sphere by playing devices to present a 360 scene, which is usually called VR360 in industrial community. The most popular formats for VR360 sources are the equirectangular projection (ERP) and the cubemap projection (CMP). Both ERP and CMP pictures can be effectively projected to a virtual three-dimensional spherical surface for rendering. It brings a new challenge to the compression of VR video sources, which is how to reallocate proper bit-rate to match mainstream projection formats. The most intuitive way to deal with this challenge is to empirically assign a fixed quantization parameter (QP) to each coding unit according to its position, which evidently lacks precision, rationality, and thus, degrades coding performance. This research proposes a new entropy equilibrium optimization (EEO) methodology to enhance the coding performance of VR360 videos. Specifically, we develop a spherical bit-rate equalization strategy to obtain a block-level Lagrangian multiplier for the rate-distortion optimization process in video coding. The appropriate QP value for each block is then dynamically determined in accordance with its. Based on our EEO methodology, we develop two algorithms, EEOA-ERP and EEOA-CMP, to enhance compression efficiency for the ERP and CMP pictures, respectively. Experimental results demonstrate that both algorithms achieve significant BD-Rate savings and outperform the HM16.17 platform for all-intra (AI), low-delay (LD) and random-access (RA) configurations, respectively. Concretely, compared with the state-of-the-art algorithm WSU-ERP, the proposed EEOA-ERP achieves BD-Rate saving of 0.37% in LD configuration. Furthermore, the proposed EEOA-CMP gains 2.6% on objective quality in RA configuration when compared with the HM16.17 VR CMP under the common test condition.

Optimizing Immersive Video Coding Configurations Using Deep Learning: A Case Study on TMIV

Optimal Camera Placement for 6 Degree-of-Freedom Immersive Video Streaming Without Accessing 3D Scenes.

Video Coding Optimization for Virtual Reality 360-Degree Source

Optimized Video Coding For Omnidirectional Videos

DATRA-MIV: Decoder-Adaptive Tiling and Rate Allocation for MPEG Immersive Video

Optimal Volumetric Video Streaming with Hybrid Saliency based Tiling

Towards 6DoF live video streaming system for immersive media

Optimal Wireless Streaming of Multi-Quality 360 VR Video By Exploiting Natural, Relative Smoothness-Enabled, and Transcoding-Enabled Multicast Opportunities

On the Optimal Encoding Ladder of Tiled 360° Videos for Head-Mounted Virtual Reality

Optimizing Inter-View Prediction Structures for Multi-View Video Coding Using Simulated Annealing

On Objective and Subjective Quality of 6dof Synthesized Live Immersive Videos

Immersive Video Compression using Implicit Neural Representations

Optimizing Fixation Prediction Using Recurrent Neural Networks for 360$^{\circ }$ Video Streaming in Head-Mounted Virtual Reality

Towards Optimal Real-time Volumetric Video Streaming: A Rolling Optimization and Deep Reinforcement Learning Based Approach

Towards Low Latency Multi-viewpoint 360° Interactive Video: A Multimodal Deep Reinforcement Learning Approach

Prediction, Communication, and Computing Duration Optimization for VR Video Streaming

Toward Adaptive Volumetric Video Streaming: A Joint Network-Viewport Adaptation Framework

One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing

Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks

Viewport Adaptation-Based Immersive Video Streaming: Perceptual Modeling and Applications.

Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting