Abstract:Neural surface reconstruction has been shown to be powerful for recovering dense 3D surfaces via image-based neural rendering. However, current methods struggle to recover detailed structures of real-world scenes. To address the issue, we present Neuralangelo, which combines the representation power of multi-resolution 3D hash grids with neural surface rendering. Two key ingredients enable our approach: (1) numerical gradients for computing higher-order derivatives as a smoothing operation and (2) coarse-to-fine optimization on the hash grids controlling different levels of details. Even without auxiliary inputs such as depth, Neuralangelo can effectively recover dense 3D surface structures from multi-view images with fidelity significantly surpassing previous methods, enabling detailed large-scale scene reconstruction from RGB video captures.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to improve the accuracy and detail level of 3D surface reconstruction from multi - view images. Specifically, current methods have difficulties in restoring the detailed structures of real - world scenes. To address this challenge, the paper proposes the Neuralangelo framework, which combines the representational power of multi - resolution 3D hash grids and neural surface rendering techniques to achieve high - fidelity 3D surface reconstruction and can work effectively even without auxiliary data such as depth or segmentation. ### Main Contributions 1. **Framework Innovation**: The Neuralangelo framework naturally incorporates the representational power of multi - resolution hash encoding into the neural SDF (Signed Distance Function) representation. 2. **Technical Improvements**: Two simple but effective techniques are proposed to improve the quality of hash - encoding - based surface reconstruction: - **Higher - Order Derivatives and Numerical Gradients**: Numerical gradients are used to calculate higher - order derivatives (such as surface normals for eikonal regularization), which are crucial for stabilizing the optimization process. - **Coarse - to - Fine Optimization Strategy**: Different levels of detail are gradually optimized to effectively restore different levels of structures. 3. **Experimental Verification**: Through extensive experiments on standard benchmarks and real - world scenes, significant improvements in reconstruction accuracy and view synthesis quality of Neuralangelo are demonstrated. ### Technical Details - **Neural Volume Rendering**: NeRF (Neural Radiance Field) maps 3D spatial positions to colors and volume densities through an MLP and uses neural volume rendering to combine these predicted values into pixel colors. - **SDF Representation**: SDF implicitly represents a surface \( S=\{x\in\mathbb{R}^3\mid f(x) = 0\} \) through its zero - level set, where \( f(x) \) is the SDF value. - **Multi - Resolution Hash Encoding**: Using a multi - resolution grid, the corner points of each grid cell are mapped to a hash entry to store the encoded features. The input position \( x_i \) is mapped to a position \( x_{i,l}=x_i\cdot V_l \) on each grid resolution \( V_l \) and the feature vector \( \gamma_l(x_{i,l}) \) is obtained through trilinear interpolation. ### Key Technologies - **Numerical Gradient Calculation**: Analytic gradients are discontinuous in position, causing optimization updates to only propagate to the local hash grid and lack non - local smoothness. By using numerical gradients, it is possible to go beyond local hash grid cells and achieve smoother optimization. - **Gradual Optimization**: By gradually activating hash grids of different resolutions, the problem of "re - learning" of fine - grained grids during the coarse optimization process is avoided, thus better capturing details. ### Experimental Results - **DTU Benchmark**: Neuralangelo achieves the best performance in both Chamfer distance and PSNR metrics, even without using auxiliary inputs. - **Tanks and Temples Dataset**: In large - scale indoor/outdoor scenes, Neuralangelo performs best in PSNR and F1 - score and can restore high - fidelity surfaces with complex details. ### Conclusion Neuralangelo significantly improves the quality and detail level of 3D surface reconstruction from multi - view images through simple technological innovations without relying on auxiliary data such as depth or segmentation. This makes Neuralangelo widely applicable and advantageous in practical applications.

Neuralangelo: High-Fidelity Neural Surface Reconstruction

HR-NeuS: Recovering High-Frequency Surface Geometry via Neural Implicit Surfaces

High-Fidelity Mask-free Neural Surface Reconstruction for Virtual Reality

Res-NeuS: Deep Residuals and Neural Implicit Surface Learning for Multi-View Reconstruction

Neural Surfel Reconstruction: Addressing Loop Closure Challenges in Large-Scale 3D Neural Scene Mapping

AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction

Neural 3D Reconstruction in the Wild

Neural Kernel Surface Reconstruction

HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details

MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

Anti-Aliased Neural Implicit Surfaces with Encoding Level of Detail

Recovering Fine Details for Neural Implicit Surface Reconstruction

NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction

Dynamic Multi-View Scene Reconstruction Using Neural Implicit Surface

UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction

NeRS: Neural Reflectance Surfaces for Sparse-view 3D Reconstruction in the Wild

Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement

NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction

NeuralRecon: Real-Time Coherent 3D Scene Reconstruction from Monocular Video

MonoNeuralFusion: Online Monocular Neural 3D Reconstruction with Geometric Priors

Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems