Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering

Francesco Di Sario,Riccardo Renzulli,Enzo Tartaglione,Marco Grangetto
2024-10-07
Abstract:Since the introduction of NeRFs, considerable attention has been focused on improving their training and inference times, leading to the development of Fast-NeRFs models. Despite demonstrating impressive rendering speed and quality, the rapid convergence of such models poses challenges for further improving reconstruction quality. Common strategies to improve rendering quality involves augmenting model parameters or increasing the number of sampled points. However, these computationally intensive approaches encounter limitations in achieving significant quality enhancements. This study introduces a model-agnostic framework inspired by Sparsely-Gated Mixture of Experts to enhance rendering quality without escalating computational complexity. Our approach enables specialization in rendering different scene components by employing a mixture of experts with varying resolutions. We present a novel gate formulation designed to maximize expert capabilities and propose a resolution-based routing technique to effectively induce sparsity and decompose scenes. Our work significantly improves reconstruction quality while maintaining competitive performance.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: to improve the rendering quality of Neural Radiance Fields (NeRFs) models without significantly increasing the computational complexity. Specifically, although the Fast - NeRFs model performs well in terms of rendering speed and quality, its rapid convergence poses a challenge to further improving the reconstruction quality. Common improvement strategies, such as increasing the number of model parameters or sampling points, can improve the quality to a certain extent, but will lead to a significant increase in computational cost. To solve these problems, the author proposes a method based on the Sparse Mixture of Experts (MoE) framework, which can significantly improve the rendering quality without increasing the computational complexity. This framework achieves this goal by allowing expert models of different resolutions to focus on different parts of the rendering scene. The following are the main contributions of this method: 1. **Proposing a general framework**: Based on sparse MoE models of different resolutions, it can significantly improve the rendering quality while maintaining competitive training and inference times. 2. **A new gating mechanism**: Inspired by the Fast - NeRF model, a new gating formula is designed to maximize the ability of the mixture of experts. 3. **Resolution - based routing technology**: A new routing technology is introduced, which gives priority to assigning tokens to low - resolution models and minimizes the use of high - resolution models, thereby decomposing the scene on a frequency basis. 4. **Extensive experimental verification**: Through experiments with different NeRF architectures and datasets, higher rendering quality and efficiency are demonstrated. This method can not only effectively improve the rendering quality, but also remain efficient in terms of computational resources, solving the trade - off problem between quality improvement and computational cost in existing methods.