TAENet: transencoder-based all-in-one image enhancement with depth awareness

Fang, Wanchuan,Wang, Chuansheng,Li, Zuoyong,Grau, Antoni,Lai, Taotao
DOI: https://doi.org/10.1007/s10489-024-05569-w
IF: 5.3
2024-06-09
Applied Intelligence
Abstract:Recently, CNN-based all-in-one image enhancement methods have been proposed to solve multiple image degradation tasks. However, these CNN-based methods usually have two limitations. One limitation is that they usually design a specific encoder for each image enhancement task, lacking of a unified and simple framework. The other limitation is that they can not effectively capture global image degradation information, as use of the CNN-based encoders has a limited local receptive field. In this work, we propose a TransEncoder-based All-in-one Image Enhancement Network (TAENet), with a single encoder and a decoder, for simultaneously handling multiple image enhancement tasks. Specifically, we propose a Transformer-based Encoder (TransEncoder), which introduces instance normalization to transformer for color recovery. The TransEncoder model global degradation information by using the transformer's global self-attention mechanism. Additionally, inspired by the Mie scattering model, we propose a novel depth loss function for perceiving image depth information by minimizing the depth difference between the enhanced image and the ground-truth, thus further improving model performance. Moreover, a novel contrastive loss is introduced to strengthen task-generalization performance by enhancing the model's representation capability. Experiments show that the TAENet outperforms 24 state-of-the-art methods on image dehazing, image deraining, and low-light image enhancement.
computer science, artificial intelligence
What problem does this paper attempt to address?