NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis

Yubin Hu,Xiaoyang Guo,Yang Xiao,Jingwei Huang,Yong-Jin Liu

2024-07-15

Abstract:This paper presents NGP-RT, a novel approach for enhancing the rendering speed of Instant-NGP to achieve real-time novel view synthesis. As a classic NeRF-based method, Instant-NGP stores implicit features in multi-level grids or hash tables and applies a shallow MLP to convert the implicit features into explicit colors and densities. Although it achieves fast training speed, there is still a lot of room for improvement in its rendering speed due to the per-point MLP executions for implicit multi-level feature aggregation, especially for real-time applications. To address this challenge, our proposed NGP-RT explicitly stores colors and densities as hash features, and leverages a lightweight attention mechanism to disambiguate the hash collisions instead of using computationally intensive MLP. At the rendering stage, NGP-RT incorporates a pre-computed occupancy distance grid into the ray marching strategy to inform the distance to the nearest occupied voxel, thereby reducing the number of marching points and global memory access. Experimental results show that on the challenging Mip-NeRF360 dataset, NGP-RT achieves better rendering quality than previous NeRF-based methods, achieving 108 fps at 1080p resolution on a single Nvidia RTX 3090 GPU. Our approach is promising for NeRF-based real-time applications that require efficient and high-quality rendering.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve real - time novel view synthesis based on Neural Radiance Field (NeRF) while maintaining high - quality rendering. Specifically, although existing NeRF methods perform well in terms of training speed and rendering quality, they still have difficulty meeting the requirements of real - time rendering in large - scale scenes. In particular, the Instant - NGP method achieves high - quality novel view synthesis through multi - level implicit feature grids and shallow MLPs, but the shallow MLPs for per - point feature aggregation still result in a relatively slow rendering speed. To solve this problem, the paper proposes the NGP - RT method, which mainly improves the rendering speed in the following two aspects: 1. **Lightweight Attention Mechanism**: NGP - RT explicitly stores colors and densities as hash features and utilizes a lightweight attention mechanism to eliminate hash collisions instead of using computationally intensive MLPs. This method can efficiently aggregate multi - level explicit hash features while maintaining high expressiveness. 2. **Occupancy - Distance Grid**: During the rendering stage, NGP - RT introduces a pre - computed occupancy - distance grid and incorporates it into the ray - marching strategy. The occupancy - distance grid can inform the distance to the nearest occupied voxel, thereby reducing the number of marching points and the number of global memory accesses, further accelerating the rendering process. Through these improvements, NGP - RT achieves better rendering quality and a higher frame rate (reaching 108 fps at 1080p resolution) than existing methods on the challenging Mip - NeRF 360 dataset. This makes NGP - RT have great potential in real - time applications that require efficient and high - quality rendering.

NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis

Multiscale Representation for Real-Time Anti-Aliasing Neural Rendering

MixRT: Mixed Neural Representations For Real-Time NeRF Rendering

MF-NeRF: Memory Efficient NeRF with Mixed-Feature Hash Table

Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering

RT-Octree: Accelerate PlenOctree Rendering with Batched Regular Tracking and Neural Denoising for Real-time Neural Radiance Fields.

CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes

Cascaded and Generalizable Neural Radiance Fields for Fast View Synthesis

How Far Can We Compress Instant-NGP-Based NeRF?

Neural Radiance Fields with Hash-Low-Rank Decomposition

ACFNeRF: Accelerating and Cache-Free Neural Rendering Via Point Cloud-Based Distance Fields.

Learning Neural Duplex Radiance Fields for Real-Time View Synthesis

FastNeRF: High-Fidelity Neural Rendering at 200FPS

Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields

A General Implicit Framework for Fast NeRF Composition and Rendering

RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering

IntrinsicNGP: Intrinsic Coordinate based Hash Encoding for Human NeRF

KiloNeRF: Speeding Up Neural Radiance Fields with Thousands of Tiny MLPs

SRender: Boosting Neural Radiance Field Efficiency Via Sensitivity-Aware Dynamic Precision Rendering

Efficient View Synthesis with Neural Radiance Distribution Field