Abstract:Infrared and visible data fusion (IVF) aims to generate a fused output that simultaneously highlights salient thermal radiation features and preserves texture information, which can not only grasp the necessary information for traffic movement, but also highlight the invisible objects that need to be dodged in intelligent transportation system (ITS). Therefore, IVF is capable of improving the environmental perception ability for various challenging traffic situations, e.g., foggy scenarios, rainy environments, and low-light illumination. However, current available IVF algorithms cannot offer a theoretical manner to integrate a priori knowledge and the network structure into a unified model. Moreover, they always fail to handle infrared and visible data pairs with different resolutions, which is a common occurrence in real ITS scenarios. To this end, this study develops a novel model-inspired unsupervised network termed IVF-Net. Specifically, an enhanced IVF model (IVFM), which pays more attention on detailed texture information and salient objects, is first established. According to proximal gradient theory, then we map this model into a deep network with learnable feature extraction parameters, aiming to draw on the strengths of the fusion model and deep learning to better describe the IVF task. Finally, a multiple task-driven loss function is designed to train the mapped network. Unlike previous work, our IVF-Net is motivated by IVFM, each layer in which has a semantic interpretability and a clear mission, thereby leading to a significantly enhanced fusion effect. Another advantage is that it is only composed of simple convolution-based structures, which ensures its lightweight and efficiency. Experiments demonstrate that IVF-Net can have a stronger ability to capture the key traffic information and highlight the salient feature of imperceptible objects, which makes it an excellent candidate to improve the reliability of subsequent applications in ITS.

IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks

IVGF: The Fusion-Guided Infrared and Visible General Framework

IVJDN: an End-to-End Network for Joint Infrared and Visible Image Fusion and Detection

Infrared and Visible Image Fusion Via Test-Time Training.

UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning

VIF-Net: an Unsupervised Framework for Infrared and Visible Image Fusion

UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks Via Adapter Tuning

Visual Fourier Prompt Tuning

Low‐light Image Enhancement for Infrared and Visible Image Fusion

Rethinking Remote Sensing Pretrained Model: Instance-Aware Visual Prompting for Remote Sensing Scene Classification.

A Retinex Decomposition Model-Based Deep Framework for Infrared and Visible Image Fusion

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?

Segmentation-Driven Infrared and Visible Image Fusion Via Transformer-Enhanced Architecture Searching.

Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model

Integrating Parallel Attention Mechanisms and Multi-Scale Features for Infrared and Visible Image Fusion

Visible-Infrared Image Fusion Based on Early Visual Information Processing Mechanisms

IAIFNet: An Illumination-Aware Infrared and Visible Image Fusion Network

IVF-Net: an Infrared and Visible Data Fusion Deep Network for Traffic Object Enhancement in Intelligent Transportation Systems

On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference