Evangelos Skartados,Mehmet Kerim Yucel,Bruno Manganelli,Anastasios Drosou,Albert Saà-Garriga
Abstract:Neural Radiance Fields (NeRF) have quickly become the primary approach for 3D reconstruction and novel view synthesis in recent years due to their remarkable performance. Despite the huge interest in NeRF methods, a practical use case of NeRFs has largely been ignored; the exploration of the scene space modelled by a NeRF. In this paper, for the first time in the literature, we propose and formally define the scene exploration framework as the efficient discovery of NeRF model inputs (i.e. coordinates and viewing angles), using which one can render novel views that adhere to user-selected criteria. To remedy the lack of approaches addressing scene exploration, we first propose two baseline methods called Guided-Random Search (GRS) and Pose Interpolation-based Search (PIBS). We then cast scene exploration as an optimization problem, and propose the criteria-agnostic Evolution-Guided Pose Search (EGPS) for efficient exploration. We test all three approaches with various criteria (e.g. saliency maximization, image quality maximization, photo-composition quality improvement) and show that our EGPS performs more favourably than other baselines. We finally highlight key points and limitations, and outline directions for future research in scene exploration.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the efficient exploration of specific viewpoints in the NeRF (Neural Radiance Fields) scene space. Specifically, the paper proposes a scene exploration framework, aiming to find camera positions and viewing angles that can generate new viewpoints meeting user - specified criteria through a given NeRF model. These criteria can include containing a certain object, improving photo composition, maximizing object saliency, etc.
### Paper Background
The NeRF method has received wide attention due to its excellent performance in 3D reconstruction and new - view synthesis. However, despite the great technological progress of the NeRF method, how to efficiently explore the scene space modeled by NeRF has not been fully studied. The paper points out that existing NeRF applications mainly focus on reconstruction and synthesis, while ignoring how to efficiently explore new viewpoints from existing NeRF models.
### Main Contributions
1. **Formally propose a scene exploration framework**: This is a framework independent of the NeRF method, aiming to find camera positions that can generate new viewpoints meeting user - specified criteria.
2. **Propose baseline methods**: Due to the lack of suitable testing methods, the paper proposes two baseline methods - Guided Random Search (GRS) and Pose Interpolation - Based Search (PIBS).
3. **Propose an Evolution - Guided Pose Search method (EGPS)**: This is a method based on genetic algorithms, which can efficiently and accurately solve the scene exploration problem.
4. **Experimental verification**: The paper conducts extensive experiments through multiple tasks (such as photo composition improvement, saliency maximization, and image quality maximization), and the results show that the EGPS method is superior to the baseline methods.
### Experimental Setup
- **NeRF method**: Use Instant - NGP as the underlying NeRF method because it has a fast convergence speed and good real - time rendering performance.
- **Dataset**: Use multiple real - scene datasets, including LLFF, Instruct - NeRF2NeRF, Instant - NGP, Tanks and Temples, etc.
- **Evaluation metric**: Propose two metrics - Criteria Value Improvement Ratio (CVIR) and mean Criteria Value Improvement Ratio (mCVIR) to quantify the improvement degree of the search method in generating new viewpoints.
### Experimental Results
- **Photo composition improvement**: Evaluate the improvement of photo composition through SAMP - Net, and the results show that the EGPS method can effectively improve the composition quality of photos.
- **Saliency maximization**: Evaluate the improvement of new viewpoints in saliency through tasks such as detecting Waldo, and the EGPS method also performs well.
- **Image quality maximization**: Evaluate the image quality of new viewpoints through a no - reference image quality evaluation network, and the EGPS method also performs well in this task.
### Conclusion
The paper successfully proposes and implements an efficient NeRF scene exploration framework, and demonstrates its superiority in multiple tasks through the EGPS method. Future research directions may include further optimizing the search algorithm, expanding application scenarios, and improving exploration efficiency.