Hierarchical Memory-Guided Long-Term Tracking with Meta Transformer Inquiry Network.

Xingmei Wang,Guohao Nie,Boquan Li,Yilin Zhao,Minyang Kang,Bo Liu
DOI: https://doi.org/10.1016/j.knosys.2023.110504
IF: 8.139
2023-01-01
Knowledge-Based Systems
Abstract:Long-term tracking is a critical and rapidly developing field in visual object tracking research. The tracking target’s appearance can change over time due to its motion or environmental factors, resulting in various appearance patterns. Those factors, such as dense distractors, confusing backgrounds, and motion blurs, make it difficult to track objects. Existing tracking algorithms typically rely on online learning to adapt to such long-term variations. However, obtaining reliable training samples and implementing effective updating schemes can be difficult, particularly when the target object frequently disappears and reappears. Therefore, in this paper, we propose a Hierarchical Memory-guided Long-term Tracker (HMLT) with a Meta Transformer Inquiry Network (MTIN) that refines online learning. Our method introduces a hierarchical memory strategy that considers simple and trustworthy updating of long-term tracking components using multiple target memories. We also devise MTIN to identify available memory based on the long-term variation pattern of the target, so as to avoid the risk of updating from incorrect samples. In addition, we devise a memory attention network to perform robust redetection based on long-term memory. Based on the hierarchical memory strategy, we construct a complete and learnable long-term tracking framework that uses a validator learned from memory to reconcile local and global searches. Our experimental results on several benchmarks, including LaSOT, VOT-LT2018, VOT-LT2019, TLP, OTB-2015, and UAV123, demonstrate that our proposed method achieves comparable performance to the state-of-the-art long-term tracking algorithms.
What problem does this paper attempt to address?