LAM3D: Large Image-Point-Cloud Alignment Model for 3D Reconstruction from Single Image

Ruikai Cui,Xibin Song,Weixuan Sun,Senbo Wang,Weizhe Liu,Shenzhou Chen,Taizhang Shang,Yang Li,Nick Barnes,Hongdong Li,Pan Ji

2024-05-24

Abstract:Large Reconstruction Models have made significant strides in the realm of automated 3D content generation from single or multiple input images. Despite their success, these models often produce 3D meshes with geometric inaccuracies, stemming from the inherent challenges of deducing 3D shapes solely from image data. In this work, we introduce a novel framework, the Large Image and Point Cloud Alignment Model (LAM3D), which utilizes 3D point cloud data to enhance the fidelity of generated 3D meshes. Our methodology begins with the development of a point-cloud-based network that effectively generates precise and meaningful latent tri-planes, laying the groundwork for accurate 3D mesh reconstruction. Building upon this, our Image-Point-Cloud Feature Alignment technique processes a single input image, aligning to the latent tri-planes to imbue image features with robust 3D information. This process not only enriches the image features but also facilitates the production of high-fidelity 3D meshes without the need for multi-view input, significantly reducing geometric distortions. Our approach achieves state-of-the-art high-fidelity 3D mesh reconstruction from a single image in just 6 seconds, and experiments on various datasets demonstrate its effectiveness.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper aims to address the issue of geometric distortion encountered when generating high-quality 3D mesh models from a single image. Although existing large reconstruction models (LRMs) have made significant progress in the automated generation of 3D content, these models often produce geometrically inaccurate 3D meshes due to the inherent challenges of inferring 3D shapes from image data. Specifically, the paper proposes a new framework called **Large Image Point Cloud Alignment Model (LAM3D)**, which utilizes 3D point cloud data to enhance the fidelity of the generated 3D meshes. LAM3D achieves this goal through the following steps: 1. **Point Cloud Network Development**: First, a point cloud-based network is developed to generate precise and meaningful latent tri-planes, laying the foundation for accurate 3D mesh reconstruction. 2. **Image-Point Cloud Feature Alignment**: Then, through single input image processing techniques, the image features are aligned with the latent tri-planes, enriching the image features with robust 3D information. This process not only enhances the image features but also promotes the generation of high-fidelity 3D meshes without the need for multi-view inputs, significantly reducing geometric distortion. Through these methods, LAM3D is capable of generating high-fidelity 3D meshes from a single image in just 6 seconds, and experiments on multiple datasets have validated its effectiveness.

LAM3D: Large Image-Point-Cloud Alignment Model for 3D Reconstruction from Single Image

Precise 3d reconstruction from a single image

3D Reconstruction and Semantic Segmentation Method Combining PointNet and 3D-Lmnet from Single Image

Outdoor Scene 3D Reconstruction from Multiple Point Cloud

Single-Image 3-D Reconstruction: Rethinking Point Cloud Deformation

Multi-View Large Reconstruction Model via Geometry-Aware Positional Encoding and Attention

DP-MVS: Detail Preserving Multi-View Surface Reconstruction of Large-Scale Scenes

Online 3D Reconstruction Based On Lidar Point Cloud

Leveraging photogrammetric mesh models for aerial-ground feature point matching toward integrated 3D reconstruction

One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

Part123: Part-aware 3D Reconstruction from a Single-view Image

2L3: Lifting Imperfect Generated 2D Images into Accurate 3D

RECONSTRUCTION OF 3D MODELS FROM POINT CLOUDS WITH HYBRID REPRESENTATION

MeshLRM: Large Reconstruction Model for High-Quality Mesh

LRM: Large Reconstruction Model for Single Image to 3D

Enhanced Multi-Scale Attention-Driven 3D Human Reconstruction from Single Image

3D3M: 3D Modulated Morphable Model for Monocular Face Reconstruction

A Point Matching Strategy of 3D Loss Function for Single RGB Images Deep Mesh Reconstruction

Accurate Reconstruction of the LoD3 Building Model by Integrating Multi-Source Point Clouds and Oblique Remote Sensing Imagery

A Novel Fusion Method of 3D Point Cloud and 2D Images for 3D Environment Reconstruction

Automated Reconstruction Of Complex Object By Integrating Point Clouds And Digital Images