PID: Physics-Informed Diffusion Model for Infrared Image Generation

Fangyuan Mao,Jilin Mei,Shun Lu,Fuyang Liu,Liang Chen,Fangzhou Zhao,Yu Hu
2024-07-12
Abstract:Infrared imaging technology has gained significant attention for its reliable sensing ability in low visibility conditions, prompting many studies to convert the abundant RGB images to infrared images. However, most existing image translation methods treat infrared images as a stylistic variation, neglecting the underlying physical laws, which limits their practical application. To address these issues, we propose a Physics-Informed Diffusion (PID) model for translating RGB images to infrared images that adhere to physical laws. Our method leverages the iterative optimization of the diffusion model and incorporates strong physical constraints based on prior knowledge of infrared laws during training. This approach enhances the similarity between translated infrared images and the real infrared domain without increasing extra training parameters. Experimental results demonstrate that PID significantly outperforms existing state-of-the-art methods. Our code is available at <a class="link-external link-https" href="https://github.com/fangyuanmao/PID" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the paper attempts to solve The paper "PID: Physical - Information Diffusion Model for Infrared Image Generation" aims to solve the problem that existing image - translation methods ignore physical laws when converting RGB images into infrared images. Specifically: 1. **Limitations of existing methods**: - **Style change**: Most existing image - translation methods regard infrared images as a style change and ignore the physical laws behind them, which limits their practical applications. - **Generation quality**: Existing methods such as variational auto - encoders (VAEs) and generative adversarial networks (GANs) can perform image translation, but the generated images often lack clarity or have problems of training instability and mode collapse. 2. **Importance of physical constraints**: - **Physical accuracy**: The generated infrared images need to conform to the actual physical characteristics, such as temperature distribution and radiation intensity. Existing generation methods have deviations in these aspects, which may lead to misidentification in downstream tasks. - **Insufficient data sets**: Obtaining infrared images requires special equipment, resulting in far fewer publicly available infrared data sets than RGB image data sets. Therefore, generating high - quality infrared images from RGB images is of great significance for enhancing infrared data sets. 3. **Proposed method**: - **Physical - information diffusion model (PID)**: The paper proposes a physical - information diffusion model (PID). By introducing physical constraints during the training process, the generated infrared images are made to conform more to the actual physical laws. This model utilizes the iterative optimization ability of the diffusion model and combines prior physical knowledge to improve the quality and authenticity of the generated images. ### Main contributions - **Adopting the diffusion model**: By analyzing the intrinsic characteristics of infrared image translation, it is found that adopting latent diffusion models (LDMs) can generate higher - quality infrared images. - **Physical constraints**: By introducing physical constraints, it is ensured that the generated images conform to the basic physical laws, thus performing better in downstream tasks. - **Efficient decomposition method**: An efficient TeV decomposition method suitable for general superimposed - spectrum infrared images is proposed, which simplifies the data collection process and broadens its application range. - **Performance improvement**: State - of - the - art results are achieved on multiple metrics. In particular, the FID scores on the FLIR and KAIST data sets are reduced by 45.14 and 55.75 respectively. ### Summary This paper solves the problem that existing methods ignore physical laws when generating infrared images by introducing the physical - information diffusion model (PID), significantly improving the quality and authenticity of the generated images. This method has important application prospects in the field of infrared image generation.