A Channel Adaptive Dual Siamese Network for Hyperspectral Object Tracking
Xiao Jiang,Xinyu Wang,Chen Sun,Zengliang Zhu,Yanfei Zhong
DOI: https://doi.org/10.1109/tgrs.2024.3378165
IF: 8.2
2024-03-30
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Hyperspectral object tracking (HOT) aims at tracking targets using the rich spectral information from hyperspectral video (HSV). Recently, dual Siamese network (DSN) has been proposed for HOT with advanced performances, via integrating a RGB Siamese branch with a hyperspectral Siamese branch to solve small sample challenge of hyperspectral modality. However, there are still challenges of DSN that reduce its practicality: a single DSN model is difficult to process HSVs with varied channels; the spatial features extracted by the pretrained RGB branch plays a dominant role, while the hyperspectral features are not fully explored. To address the challenges, we propose a Channel AdapTive DSN, termed SiamCAT, for HOT with varied channels. Specifically, treating each frame of HSV as a grayscale image sequence varied with wavelengths, a channel adaptive (CA) module is introduced to encode the grayscale image sequence of different lengths into a uniform length, and so that SiamCAT can process HSV with varied channels. Meanwhile, a guided learning attention (GLA) module is proposed to progressively learn spectral features of the tracked target highlighted by the spatial attention of the pretrained RGB branch. Note that, to force spectral features play a leading role, instead of traditional features fusion, the spectral features extracted by the hyperspectral branch are utilized for confirming the target position. In the experiments, SiamCAT were verified by using the HOT competition dataset (i.e., 16-channel, 25-channel, and 15-channel HSVs with different wavelength ranges) and the WHU-Hi-H3 dataset (25-channel HSVs), and achieved advanced performances.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics