Abstract:Recent advances in 3D deep learning have shown that it is possible to train highly effective deep models for 3D shape generation, directly from 2D images. This is particularly interesting since the availability of 3D models is still limited compared to the massive amount of accessible 2D images, which is invaluable for training. The representation of 3D surfaces itself is a key factor for the quality and resolution of the 3D output. While explicit representations, such as point clouds and voxels, can span a wide range of shape variations, their resolutions are often limited. Mesh-based representations are more efficient but are limited by their ability to handle varying topologies. Implicit surfaces, however, can robustly handle complex shapes, topologies, and also provide flexible resolution control. We address the fundamental problem of learning implicit surfaces for shape inference without the need of 3D supervision. Despite their advantages, it remains nontrivial to (1) formulate a differentiable connection between implicit surfaces and their 2D renderings, which is needed for image-based supervision; and (2) ensure precise geometric properties and control, such as local smoothness. In particular, sampling implicit surfaces densely is also known to be a computationally demanding and very slow operation. To this end, we propose a novel ray-based field probing technique for efficient image-to-field supervision, as well as a general geometric regularizer for implicit surfaces, which provides natural shape priors in unconstrained regions. We demonstrate the effectiveness of our framework on the task of single-view image-based 3D shape digitization and show how we outperform state-of-the-art techniques both quantitatively and qualitatively.

Inferring 3D Occupancy Fields Through Implicit Reasoning on Silhouette Images

In-Hand 3D Object Reconstruction from a Monocular RGB Video

Unsupervised Occupancy Learning from Sparse Point Cloud

Learning to Infer Implicit Surfaces without 3D Supervision

Learning Occupancy Function from Point Clouds for Surface Reconstruction

Efficient Implicit Neural Reconstruction Using LiDAR

ReN Human: Learning Relightable Neural Implicit Surfaces for Animatable Human Rendering

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision

Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors

Evaluation of Five Enzyme Immunoassays Compared with the Cytotoxicity Assay for Diagnosis of Clostridium Difficile-Associated Diarrhea in Dogs

Learning Visibility Field for Detailed 3D Human Reconstruction and Relighting

Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation

Holistic 3D Scene Understanding from a Single Image with Implicit Representation.

OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow

Learning a Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with Occupancy Aids Scene Representation

Learning Occupancy for Monocular 3D Object Detection

RadOcc: Learning Cross-Modality Occupancy Knowledge through Rendering Assisted Distillation

3D Keypoint Estimation Using Implicit Representation Learning

ImpDet: Exploring Implicit Fields for 3D Object Detection

Neural Implicit 3D Shapes from Single Images with Spatial Patterns.