Abstract:Motion compensation with fractional motion vector has been widely utilized in the video coding standards. The fractional samples are usually generated by fractional interpolation filters. Traditional interpolation filters are usually designed based on the signal processing theory with the assumption of band-limited signal, which cannot effectively capture the non-stationary property of video content and cannot adapt to the variety of video quality. In this paper, we reveal an intuitive property of the fractional interpolation problem, named invertibility. That is, the fractional interpolation filters should not only generate fractional samples from integer samples but also recover the integer samples from the fractional samples in an invertible manner. We prove in theory that the invertibility in the spatial domain is equivalent to the constant magnitude in the Fourier transform domain. Driven by the invertibility, we then develop a learning-based method to solve the fractional interpolation problem. Inspired by the advances of convolutional neural network (CNN), we propose to establish an end-to-end scheme using CNN to train invertibility-driven interpolation filter (InvIF). Different from the previous learning-based methods, the proposed training scheme does not need hand-crafted "ground truth" of fractional samples. The proposed InvIF is integrated into high efficiency video coding (HEVC), and extensive experiments are conducted to verify its effectiveness. The experimental results show that the proposed method can achieve on average 4.7% and 3.6% BD-rate reduction compared with the HEVC anchor, under low-delay-B and random-access configurations, respectively.

The Invertible U-Net for Optical-Flow-free Video Interframe Generation

Multiframe Interpolation for Video Using Phase Features

Frame Interpolation Using Phase and Amplitude Feature Pyramids

A Unified Pyramid Recurrent Network for Video Frame Interpolation

Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation

Channel Attention Is All You Need for Video Frame Interpolation

A Temporally-Aware Interpolation Network for Video Frame Inpainting

Invertibility-Driven Interpolation Filter for Video Coding.

Bidirectional Transformer for Video Deblurring

Video Diffusion Models are Strong Video Inpainter

Video Frame Interpolation via Adaptive Separable Convolution

Video Frame Synthesis using Deep Voxel Flow

Video Frame Interpolation without Temporal Priors

A Flexible Recurrent Residual Pyramid Network for Video Frame Interpolation

Convolutional Neural Network-Based Invertible Half-Pixel Interpolation Filter for Video Coding.

IDO-VFI: Identifying Dynamics via Optical Flow Guidance for Video Frame Interpolation with Events

Disentangled Motion Modeling for Video Frame Interpolation

Fine-Grained Motion Estimation for Video Frame Interpolation

Efficient Feature Extraction for High-resolution Video Frame Interpolation