Abstract:This paper presents a novel method for exerting fine-grained lighting control during text-driven diffusion-based image generation. While existing diffusion models already have the ability to generate images under any lighting condition, without additional guidance these models tend to correlate image content and lighting. Moreover, text prompts lack the necessary expressional power to describe detailed lighting setups. To provide the content creator with fine-grained control over the lighting during image generation, we augment the text-prompt with detailed lighting information in the form of radiance hints, i.e., visualizations of the scene geometry with a homogeneous canonical material under the target lighting. However, the scene geometry needed to produce the radiance hints is unknown. Our key observation is that we only need to guide the diffusion process, hence exact radiance hints are not necessary; we only need to point the diffusion model in the right direction. Based on this observation, we introduce a three stage method for controlling the lighting during image generation. In the first stage, we leverage a standard pretrained diffusion model to generate a provisional image under uncontrolled lighting. Next, in the second stage, we resynthesize and refine the foreground object in the generated image by passing the target lighting to a refined diffusion model, named DiLightNet, using radiance hints computed on a coarse shape of the foreground object inferred from the provisional image. To retain the texture details, we multiply the radiance hints with a neural encoding of the provisional synthesized image before passing it to DiLightNet. Finally, in the third stage, we resynthesize the background to be consistent with the lighting on the foreground object. We demonstrate and validate our lighting controlled diffusion model on a variety of text prompts and lighting conditions.

Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory

Replica exchange light transport on relaxed distributions.

Toward a General Model for Reflection Recovery and Single Image Enhancement.

Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model

LightIt: Illumination Modeling and Control for Diffusion Models

DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

Controllable Light Diffusion for Portraits

Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model

DifFRelight: Diffusion-Based Facial Performance Relighting

ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement

A Diffusion Approach to Radiance Field Relighting using Multi‐Illumination Synthesis

A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis

LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models

Differential Diffusion: Giving Each Pixel Its Strength

A New Radiosity Approach by Procedural Refinements for Realistic Image Sythesis

A Novel Retinex Based Approach for Image Enhancement with Illumination Adjustment

Diffusion Model-Based Image Editing: A Survey

RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models

Neural Gaffer: Relighting Any Object via Diffusion

Efficient Diffusion as Low Light Enhancer

LiteDiT: An Efficient Diffusion Transformer Model for Remote Sensing Image Synthesis Focus on Object