Abstract:Blind inverse problems, where both the target data and forward operator are unknown, are crucial to many computer vision applications. Existing methods often depend on restrictive assumptions such as additional training, operator linearity, or narrow image distributions, thus limiting their generalizability. In this work, we present LADiBI, a training-free framework that uses large-scale text-to-image diffusion models to solve blind inverse problems with minimal assumptions. By leveraging natural language prompts, LADiBI jointly models priors for both the target image and operator, allowing for flexible adaptation across a variety of tasks. Additionally, we propose a novel posterior sampling approach that combines effective operator initialization with iterative refinement, enabling LADiBI to operate without predefined operator forms. Our experiments show that LADiBI is capable of solving a broad range of image restoration tasks, including both linear and nonlinear problems, on diverse target image distributions.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is **Blind Inverse Problems**, especially the image restoration task in computer vision applications when both the target data and the forward operator are unknown. Specifically, the authors propose a new method named LADiBI (Language - Assisted Diffusion for Blind Inverse problems), aiming to use large - scale pre - trained text - to - image diffusion models to solve blind inverse problems without additional training or data collection. ### Main Challenges 1. **Limitations of Existing Methods**: Most existing methods rely on restrictive assumptions, such as requiring additional training, operator linearity assumptions, or narrow image distributions, which limit their generalization ability. 2. **The Nature of Blind Inverse Problems**: Since both the target data and the forward operator are unknown, such problems are ill - posed in nature, that is, there is no unique solution, so prior knowledge or other assumptions need to be introduced to solve them. ### Solutions in the Paper 1. **Training - Free Framework**: LADiBI is a training - free framework that uses large - scale pre - trained text - to - image diffusion models to solve blind inverse problems with minimal assumptions. 2. **Joint Modeling of Priors**: Through natural language prompts, LADiBI simultaneously models priors for images and operators, thus flexibly adapting to various tasks. 3. **Novel Posterior Sampling Method**: A new posterior sampling method is proposed, combined with effective operator initialization and iterative optimization, enabling LADiBI to operate without pre - defining the operator form. 4. **Wide Applicability**: Experiments show that LADiBI can solve a wide range of image restoration tasks, including linear and nonlinear problems, and is suitable for diverse target image distributions. ### Mathematical Expression Blind inverse problems can be formalized as: \[ y = A_\phi(x)+n \] where: - \( y \) is the observed measurement value, - \( A_\phi \) is the forward operator with parameter \( \phi \), - \( x \) is the unknown target data, - \( n \) is the measurement noise. The goal of LADiBI is to sample from the posterior distribution \( p(x, A_\phi|y)\propto p(y|x, A_\phi)p(x, A_\phi) \) to recover the unknown data \( x \) and the operator \( A_\phi \). ### Summary The main contribution of this paper is to provide a general and flexible framework that uses large - scale pre - trained text - to - image diffusion models to solve blind inverse problems, significantly reducing the dependence on specific tasks and the need for additional training. This method not only improves the generalization ability of the algorithm but also expands its flexibility and practicality in practical applications.

Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion

Blind Inversion using Latent Diffusion Priors

Blind Deconvolution Using an Improved L0 Sparse Representation

Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models

Prompt-tuning latent diffusion models for inverse problems

Blind Image Restoration via Fast Diffusion Inversion

G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving

GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration

Particle-Filtering-based Latent Diffusion for Inverse Problems

Training Adaptive Reconstruction Networks for Blind Inverse Problems

A Variational Perspective on Solving Inverse Problems with Diffusion Models

Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models

Inverse Problems with Diffusion Models: A MAP Estimation Perspective

EDICT: Exact Diffusion Inversion via Coupled Transformations

Ambiguity in solving imaging inverse problems with deep learning based operators

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models

Taming Generative Diffusion Prior for Universal Blind Image Restoration

Solving Video Inverse Problems Using Image Diffusion Models

Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling