Imaging through the Atmosphere using Turbulence Mitigation Transformer

Xingguang Zhang,Zhiyuan Mao,Nicholas Chimitt,Stanley H. Chan
2023-12-11
Abstract:Restoring images distorted by atmospheric turbulence is a ubiquitous problem in long-range imaging applications. While existing deep-learning-based methods have demonstrated promising results in specific testing conditions, they suffer from three limitations: (1) lack of generalization capability from synthetic training data to real turbulence data; (2) failure to scale, hence causing memory and speed challenges when extending the idea to a large number of frames; (3) lack of a fast and accurate simulator to generate data for training neural networks. In this paper, we introduce the turbulence mitigation transformer (TMT) that explicitly addresses these issues. TMT brings three contributions: Firstly, TMT explicitly uses turbulence physics by decoupling the turbulence degradation and introducing a multi-scale loss for removing distortion, thus improving effectiveness. Secondly, TMT presents a new attention module along the temporal axis to extract extra features efficiently, thus improving memory and speed. Thirdly, TMT introduces a new simulator based on the Fourier sampler, temporal correlation, and flexible kernel size, thus improving our capability to synthesize better training data. TMT outperforms state-of-the-art video restoration models, especially in generalizing from synthetic to real turbulence data. Code, videos, and datasets are available at \href{<a class="link-external link-https" href="https://xg416.github.io/TMT" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://xg416.github.io/TMT" rel="external noopener nofollow">this https URL</a>}.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the problem of restoring image distortion caused by atmospheric turbulence. Specifically, the paper points out that although existing deep - learning - based methods perform well under specific test conditions, they have three main limitations: 1. **Insufficient generalization ability from synthetic training data to real turbulence data**: Existing methods perform poorly when dealing with turbulence data in the real world because they are mainly trained on synthetic data. 2. **Poor scalability, leading to memory and speed problems**: When attempting to process a large number of frames, existing methods will encounter memory and speed challenges. 3. **Lack of a fast and accurate simulator to generate data required for training neural networks**: Existing turbulence simulators are either too slow or inaccurate to generate sufficient training data. To solve these problems, the paper introduces a new multi - frame image restoration transformer - Turbulence Mitigation Transformer (TMT). The main contributions of TMT include: 1. **Explicit use of the turbulence physical model**: - Decompose the restoration task into two steps: de - tilting and de - blurring, instead of using a single - step network like the general video restoration transformer. - Introduce a multi - scale loss function to supervise training at different resolutions, thereby improving generalization ability. 2. **Introduce the Temporal - Channel Joint Attention (TCJA) module**: - TCJA allows TMT to process more frames with less memory, significantly increasing the time range of available frames and providing better generalization ability and speed. 3. **Propose a new turbulence simulator**: - Based on Fourier samplers, temporal correlations, and flexible kernel sizes, the new simulator can generate higher - quality training data. Through these improvements, TMT outperforms existing video restoration models in terms of generalization ability from synthetic data to real turbulence data, especially in dynamic and static scenes. The code, video, and dataset can be obtained at [https://xg416.github.io/TMT].