Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields

Hyeonseop Song,Seokhun Choi,Hoseok Do,Chul Lee,Taehyeong Kim
2023-09-11
Abstract:Text-driven localized editing of 3D objects is particularly difficult as locally mixing the original 3D object with the intended new object and style effects without distorting the object's form is not a straightforward process. To address this issue, we propose a novel NeRF-based model, Blending-NeRF, which consists of two NeRF networks: pretrained NeRF and editable NeRF. Additionally, we introduce new blending operations that allow Blending-NeRF to properly edit target regions which are localized by text. By using a pretrained vision-language aligned model, CLIP, we guide Blending-NeRF to add new objects with varying colors and densities, modify textures, and remove parts of the original object. Our extensive experiments demonstrate that Blending-NeRF produces naturally and locally edited 3D objects from various text prompts. Our project page is available at <a class="link-external link-https" href="https://seokhunchoi.github.io/Blending-NeRF/" rel="external noopener nofollow">this https URL</a>
Computer Vision and Pattern Recognition,Artificial Intelligence,Graphics
What problem does this paper attempt to address?
The paper aims to address the challenge of local editing of 3D objects, particularly the problem of locally blending the original 3D object with new objects and style effects without distorting the object's form. To tackle this challenge, the authors propose a new model based on NeRF (Neural Radiance Fields) called Blending-NeRF. This model consists of two NeRF networks: a pre-trained NeRF and an editable NeRF, and introduces a new blending operation that allows precise editing of the target area through text input. Utilizing the pre-trained visual language alignment model CLIP, Blending-NeRF can add new objects with different colors and densities, modify textures, and remove parts of the original object based on different text prompts. Experimental results show that Blending-NeRF can naturally perform local edits from various text prompts, demonstrating superior performance.