Abstract:Implicit neural representations store videos as neural networks and have performed well for various vision tasks such as video compression and denoising. With frame index or positional index as input, implicit representations (NeRV, E-NeRV, \etc) reconstruct video from fixed and content-agnostic embeddings. Such embedding largely limits the regression capacity and internal generalization for video interpolation. In this paper, we propose a Hybrid Neural Representation for Videos (HNeRV), where a learnable encoder generates content-adaptive embeddings, which act as the decoder input. Besides the input embedding, we introduce HNeRV blocks, which ensure model parameters are evenly distributed across the entire network, such that higher layers (layers near the output) can have more capacity to store high-resolution content and video details. With content-adaptive embeddings and re-designed architecture, HNeRV outperforms implicit methods in video regression tasks for both reconstruction quality ($+4.7$ PSNR) and convergence speed ($16\times$ faster), and shows better internal generalization. As a simple and efficient video representation, HNeRV also shows decoding advantages for speed, flexibility, and deployment, compared to traditional codecs~(H.264, H.265) and learning-based compression methods. Finally, we explore the effectiveness of HNeRV on downstream tasks such as video compression and video inpainting. We provide project page at <a class="link-external link-https" href="https://haochen-rye.github.io/HNeRV" rel="external noopener nofollow">this https URL</a>, and Code at <a class="link-external link-https" href="https://github.com/haochen-rye/HNeRV" rel="external noopener nofollow">this https URL</a>

DNeRV: Modeling Inherent Dynamics Via Difference Neural Representation for Videos.

Towards Scalable Neural Representation for Diverse Videos

DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes

PNeRV: A Polynomial Neural Representation for Videos

VQ-NeRV: A Vector Quantized Neural Representation for Videos

VQNeRV: Vector Quantization Neural Representation for Video Compression

HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

HNeRV: A Hybrid Neural Representation for Videos

NeRV: Neural Representations for Videos

NERV++: An Enhanced Implicit Neural Video Representation

Temporal Enhanced Hybrid Neural Representation for Video Compression

PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos

Neural Video Representation for Redundancy Reduction and Consistency Preservation

Implicit Neural Representation for Videos Based on Residual Connection

How Deep Neural Networks Understand Motion? Toward Interpretable Motion Modeling by Leveraging the Relative Change in Position

E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context

MNeRV: A Multilayer Neural Representation for Videos

Fast Encoding and Decoding for Implicit Video Representation

Immersive Video Compression using Implicit Neural Representations

FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos

Ps-nerv: patch-wise stylized neural representations for videos