Differentiable Inverse Rendering with Interpretable Basis BRDFs

Hoon-Gyu Chung,Seokjun Choi,Seung-Hwan Baek
2024-12-02
Abstract:Inverse rendering seeks to reconstruct both geometry and spatially varying BRDFs (SVBRDFs) from captured images. To address the inherent ill-posedness of inverse rendering, basis BRDF representations are commonly used, modeling SVBRDFs as spatially varying blends of a set of basis BRDFs. However, existing methods often yield basis BRDFs that lack intuitive separation and have limited scalability to scenes of varying complexity. In this paper, we introduce a differentiable inverse rendering method that produces interpretable basis BRDFs. Our approach models a scene using 2D Gaussians, where the reflectance of each Gaussian is defined by a weighted blend of basis BRDFs. We efficiently render an image from the 2D Gaussians and basis BRDFs using differentiable rasterization and impose a rendering loss with the input images. During this analysis-by-synthesis optimization process of differentiable inverse rendering, we dynamically adjust the number of basis BRDFs to fit the target scene while encouraging sparsity in the basis weights. This ensures that the reflectance of each Gaussian is represented by only a few basis BRDFs. This approach enables the reconstruction of accurate geometry and interpretable basis BRDFs that are spatially separated. Consequently, the resulting scene representation, comprising basis BRDFs and 2D Gaussians, supports physically-based novel-view relighting and intuitive scene editing.
Computer Vision and Pattern Recognition,Graphics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the reconstruction problem of the spatially - varying bidirectional reflectance distribution function (SVBRDFs) in **inverse rendering** with geometric and spatial variations. Specifically, the author focuses on how to reconstruct accurate geometric shapes and interpretable basis BRDFs (Basis BRDFs) from captured multi - view flash photography images. Traditional methods have limitations when dealing with scenes of different complexities, such as the lack of intuitive separation of basis BRDFs, excessive optimization time, and low accuracy. ### Specific challenges of the problem 1. **Non - intuitiveness and entanglement of basis BRDFs**: - The basis BRDFs generated by existing methods are often uninterpretable. Each scene point is represented by multiple basis BRDFs with high weights and is difficult to understand. 2. **Fixed number of basis BRDFs**: - Existing methods usually use a fixed number of basis BRDFs and cannot adapt to scenes of different complexities. 3. **Limited illumination and viewing angles**: - When the available illumination and viewing angles are limited, accurate inverse rendering becomes particularly difficult. ### Solutions proposed in the paper To address the above challenges, the paper proposes a **differentiable inverse - rendering method**, and its main contributions include: 1. **Joint estimation of 2D Gaussian distribution and basis BRDF**: - Use 2D Gaussian distribution to represent geometric shapes and model the reflectance of each Gaussian distribution by weighted mixing of basis BRDFs. 2. **Dynamically adjust the number of basis BRDFs**: - During the analysis - synthesis optimization process, dynamically adjust the number of basis BRDFs, encourage the sparsity of basis BRDF weights, and ensure that the reflectance of each Gaussian distribution is represented by only a few basis BRDFs. 3. **Enhance training stability**: - Introduce weighted photometric loss, focusing on potential specular regions to improve training stability. 4. **Basis BRDF control methods**: - Include merging similar basis BRDFs and removing basis BRDFs that contribute less to the scene to obtain interpretable basis BRDFs. 5. **Sparsity regularization**: - Impose sparsity constraints on basis BRDF weights so that each Gaussian distribution is represented by only a few basis BRDFs, thereby achieving a spatially - separated SVBRDF representation. Through these improvements, this method can not only reconstruct accurate geometric shapes but also generate interpretable basis BRDFs, which are suitable for downstream tasks such as physically - reasonable new - view relighting and intuitive scene editing.