Abstract:The quality of three-dimensional reconstruction is a key factor affecting the effectiveness of its application in areas such as virtual reality (VR) and augmented reality (AR) technologies. Neural Radiance Fields (NeRF) can generate realistic images from any viewpoint. It simultaneously reconstructs the shape, lighting, and materials of objects, and without surface defects, which breaks down the barrier between virtuality and reality. The potential spatial correspondences displayed by NeRF between reconstructed scenes and real-world scenes offer a wide range of practical applications possibilities. Despite significant progress in 3D reconstruction since NeRF were introduced, there remains considerable room for exploration and experimentation. NeRF-based models are susceptible to interference issues caused by colored "fog" noise. Additionally, they frequently encounter instabilities and failures while attempting to reconstruct unbounded scenes. Moreover, the model takes a significant amount of time to converge, making it even more challenging to use in such scenarios. Our approach, coined Enhance-NeRF, which adopts joint color to balance low and high reflectivity objects display, utilizes a decoding architecture with prior knowledge to improve recognition, and employs multi-layer performance evaluation mechanisms to enhance learning capacity. It achieves reconstruction of outdoor scenes within one hour under single-card condition. Based on experimental results, Enhance-NeRF partially enhances fitness capability and provides some support to outdoor scene reconstruction. The Enhance-NeRF method can be used as a plug-and-play component, making it easy to integrate with other NeRF-based models. The code is available at: https://github.com/TANQIanQ/Enhance-NeRF

DSEM-NeRF: Multimodal feature fusion and global-local attention for enhanced 3D scene reconstruction

Attention-based Multi-modal Fusion Network for Semantic Scene Completion.

${M^2D}$NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields

DSSMNeRF: Depth Self-supervised MVS NeRF

Semantic Reconstruction based on RGB Image and Sparse Depth

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding

S$^3$-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint

3D Former: Monocular Scene Reconstruction with 3D SDF Transformers

Semantic Is Enough: Only Semantic Information For NeRF Reconstruction

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

MM-NeRF: Large-Scale Scene Representation with Multi-Resolution Hash Grid and Multi-View Priors Features

IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement

Multi-Modal Attention-based Fusion Model for Semantic Segmentation of RGB-Depth Images

Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey

Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields

NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes

FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models

DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images

DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation