Fusiform multi-scale pixel self-attention network for hyperspectral images reconstruction from a single RGB image

Zhongmin Jiang,Wanyan Zhang,Wenju Wang
DOI: https://doi.org/10.1007/s00371-023-03006-6
2023-07-24
Abstract:Current research on deep learning algorithms focuses on the reconstruction of hyperspectral images from a single RGB image. However, this does not consider the feature information between regions, so the feature capture of context is insufficient. This causes the quality of reconstructed hyperspectral images to be low. We propose correcting this with a fusiform multi-scale pixel self-attention (FMPSA) network. The proposed FMPSA consists of a fusiform multi-scale feature extraction (FMFE) module cascaded with several multi-scale adaptive residual attention blocks (MARABs). FMFE extracts multi-scale detail features by interleaving dual components to avoid degrading spectral reconstruction quality due to local and edge spatial information loss. Each MARAB consists of paired FMFE-Left and FMFE-Right components, an optimal non-local model, a pixel self-attention module, a LayerNorm layer, a multilayer perceptron with Gelu nonlinearity, and long-short dual residual connection, which can be regarded as a residual structure based on a pixel self-attention mechanism. MARAB can adaptively track regions containing feature-rich information for more accurate hyperspectral reconstruction with a hierarchical focus on the salient pixels. The proposed FMPSA was applied to the NTIRE 2020 hyperspectral dataset. Experimental results show that the proposed method outperforms current methods in terms of MRAE and RMSE.
computer science, software engineering
What problem does this paper attempt to address?