Abstract:Augmented reality, interactive navigation in 3D scenes, multiview video, and other emerging multimedia applications require large sets of images, hence larger data volumes and increased resources compared with traditional video services. The significant increase in the number of images in multiview systems leads to new challenging problems in data representation and data transmission to provide high quality of experience on resource-constrained environments. In order to reduce the size of the data, different multiview video compression strategies have been proposed recently. Most of them use the concept of reference or key views that are used to estimate other images when there is high correlation in the data set. In such coding schemes, the two following questions become fundamental: 1) how many reference views have to be chosen for keeping a good reconstruction quality under coding cost constraints? And 2) where to place these key views in the multiview data set? As these questions are largely overlooked in the literature, we study the reference view selection problem and propose an algorithm for the optimal selection of reference views in multiview coding systems. Based on a novel metric that measures the similarity between the views, we formulate an optimization problem for the positioning of the reference views, such that both the distortion of the view reconstruction and the coding rate cost are minimized. We solve this new problem with a shortest path algorithm that determines both the optimal number of reference views and their positions in the image set. We experimentally validate our solution in a practical multiview distributed coding system and in the standardized 3D-HEVC multiview coding scheme. We show that considering the 3D scene geometry in the reference view, positioning problem brings significant rate-distortion improvements and outperforms the traditional coding strategy that simply selects key frames based on the distance between cameras.

Deep virtual reference frame generation for multiview video coding

Disparity-Aware Reference Frame Generation Network for Multiview Video Coding

Rate-distortion Based Reference Viewpoints Selection for Multi-View Video Plus Depth Coding

Multi-view Video Coding Based on View Prediction

Enhanced Motion-Compensated Video Coding with Deep Virtual Reference Frame Generation

A Network-Friendly Architecture for Multi-View Video Coding (Mvc)

A 3D-HEVC Fast Mode Decision Algorithm for Real-Time Applications

Multi-View Video Coding Based on Vector Estimation and Weighted Disparity Interpolation

Improved Multi-View Depth Estimation For View Synthesis In 3d Video Coding

Reduced Resolution Depth Compression for Multiview Video Plus Depth Coding

Perceptual Multiview Video Coding Based On Foveated Just Noticeable Distortion Profile In Dct Domain

Applications of just-noticeable depth difference model in joint multiview video plus depth coding

Rendering-oriented Multiview Video Coding Based on Chrominance Information Reconstruction

Disparity Vector Based Advanced Inter-View Prediction in 3D-HEVC.

A novel depth coding scheme for multiview video plus depth compression

Reference View Selection in DIBR-Based Multiview Coding

Joint Bit Allocation for 3D Video Coding Based on Virtual View Distortion

Multi-view Video Coding Based on Resultant Vector Estimation

Deep Multi-Domain Prediction for 3D Video Coding.

Multiview and 3D Video Compression Using Neighboring Block Based Disparity Vectors

Depth Video Inter Coding Based on Deep Frame Generation