3D Scene Diffusion Guidance using Scene Graphs

Mohammad Naanaa,Katharina Schmid,Yinyu Nie

2023-08-08

Abstract:Guided synthesis of high-quality 3D scenes is a challenging task. Diffusion models have shown promise in generating diverse data, including 3D scenes. However, current methods rely directly on text embeddings for controlling the generation, limiting the incorporation of complex spatial relationships between objects. We propose a novel approach for 3D scene diffusion guidance using scene graphs. To leverage the relative spatial information the scene graphs provide, we make use of relational graph convolutional blocks within our denoising network. We show that our approach significantly improves the alignment between scene description and generated scene.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to more accurately align the relationship between the generated scene and the input description when generating high - quality 3D scenes. Existing methods mainly rely on text embedding to control the generation process, and this method performs poorly when dealing with complex spatial relationships. Therefore, the author proposes a new method based on Scene Graphs. By using the relative spatial information provided in the scene graph and using Relational Graph Convolutional Blocks to improve the denoising process in the generation network, the alignment between the generated scene and the given conditions is significantly improved. Specifically, the main contributions of the paper include: - Proposing a new 3D scene diffusion guidance method that uses the scene graph as a condition. - Introducing a novel technique for conditioning matrix - shaped data on the scene graph, using the relational graph convolutional network. - Through experimental verification, this method can significantly improve the alignment between the generated scene and the given conditions. Through a series of experiments and evaluations, including quantitative and qualitative analysis, the paper proves that the proposed method has superior performance in generating 3D scenes that conform to complex input descriptions.

3D Scene Diffusion Guidance using Scene Graphs

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis

DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

Denoising Diffusion via Image-Based Rendering

Joint Generative Modeling of Scene Graphs and Images via Diffusion Models

Novel 3D-Aware Composition Images Synthesis for Object Display with Diffusion Model.

SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis

GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs

Mixed Diffusion for 3D Indoor Scene Synthesis

DORSal: Diffusion for Object-centric Representations of Scenes et al

Move Anything with Layered Scene Diffusion

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

External Knowledge Enhanced 3D Scene Generation from Sketch

Generating Images with 3D Annotations Using Diffusion Models

R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Contrastive Control Diffusion

Generative Novel View Synthesis with 3D-Aware Diffusion Models

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting