Tracking in tracking: An efficient method to solve the tracking distortion

Jinzhen Yao,Zhixing Wang,Jianlin Zhang,Qintao Hu,Chuanming Tang,Qiliang Bao,Zhenming Peng
DOI: https://doi.org/10.1016/j.engappai.2024.108698
IF: 8
2024-06-07
Engineering Applications of Artificial Intelligence
Abstract:Current Siamese trackers have paid more attention to Transformer-based structures for their extraordinary improvements in accuracy through extensive information fusion and cross-attention enhancement. However, the further elevation of traditional Siamese trackers' performance is lagged by the low robustness of the interference from similar distractors. The representation ability and the discrimination against similar distractors are always incompatible. Even though the representation ability of recent Transformer-based trackers is broadly enhanced, it still causes a high response to similar distractors owing to the similarity matching mechanism of the Siamese structure. To tackle the above problems, we propose a Tracking-in-Tracking (an outer tracker with an inner tracker) pipeline (TiT) consisting of an antecedent tracking stage and a refining tracking stage. Instead of just capturing a single candidate matched with the template, perceiving all potential candidates can provide proper information on possible similar distractors. Based on this insight, a Transformer-based outer tracker is constructed to recognize all candidates in the antecedent tracking stage. Subsequently, in the refining tracking stage, an inner tracker is applied to further realize accurate object identification from all selected candidates with a designed bilateral feedback mechanism (BFM) and peak distilling module (PDM). Therefore, the Transformer-based outer tracker and Motion-estimated inner tracker can supervise each other to achieve robust tracking performance without further aggravating model complexity and memory burden. Extensive experiments have demonstrated that our TiT can serve as a unified framework to discriminate similar interference and perform state-of-the-art (SOTA) performance in mainstream benchmarks.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
What problem does this paper attempt to address?