Abstract:In this paper, we address the unified image restoration challenge by reframing it as a contrastive learning-based classification problem. Despite the significant strides made by deep learning methods in enhancing image restoration quality, their limited capacity to generalize across diverse degradation types and intensities necessitates the training of separate models for each specific degradation scenario. We proposes an all-encompassing approach that can restore images from various unknown corruption types and levels. We devise a method that learns representations of the latent sharp image’s degradation and accompanying textual features (such as dataset categories and image content descriptions), converting these into prompts which are then embedded within a reconstruction network model to enhance cross-database restoration performance. This culminates in a unified image reconstruction framework. The study involves two stages: In the first stage, we design a MultiContentNet that learns multi-modal features (MMFs) of the latent sharp image. This network encodes the visual degradation expressions and contextual text features into latent variables, thereby exerting a guided classification effect. Specifically, MultiContentNet is trained as an auxiliary controller capable of taking the degraded input image and, through contrastive learning, extracts MMFs of the latent target image. This effectively generates natural classifiers tailored for different degradation types. The second phase integrates the learned MMFs into an image restoration network via cross-attention mechanisms. This guides the restoration model to learn high-fidelity image recovery. Experiments conducted on six blind image restoration tasks demonstrate that the proposed method achieves state-of-the-art performance, highlighting the potential significance of large-scale pretrained vision-language models’ MMFs in advancing high-quality unified image reconstruction.

UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration

Vision Transformers for Single Image Dehazing

Multi-Weather Degradation-Aware Transformer for Image Restoration

MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations

ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions

Joint multi-dimensional dynamic attention and transformer for general image restoration

Universal Image Restoration with Text Prompt Diffusion

Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration

DR-DiT: Image Deraining Using Diffusion Model with Transformer

Multi-modal Degradation Feature Learning for Unified Image Restoration Based on Contrastive Learning

An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.

Always Clear Days: Degradation Type and Severity Aware All-In-One Adverse Weather Removal

A Unified Conditional Framework for Diffusion-based Image Restoration

WaveDM: Wavelet-Based Diffusion Models for Image Restoration

Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks

Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration

Restoring Vision in Adverse Weather Conditions With Patch-Based Denoising Diffusion Models

Multi-patch de-raindrop Transformer for UAV images