Learning Motion-Perceive Siamese network for robust visual object tracking
Ze Kang,Tianyang Xu,Xue-Feng Zhu,Xiao-Jun Wu
DOI: https://doi.org/10.1016/j.patrec.2023.07.011
IF: 4.757
2023-07-23
Pattern Recognition Letters
Abstract:Siamese networks enable end-to-end training for visual tracking, achieving excellent performance in recent years. However, classical Siamese-based formulation relies on an offline-trained appearance model to perform tracking for each frame, ignoring the temporal variation at the online stage. As been verified that temporal motion is crucial for robust and accurate tracking, we propose a novel Motion-Perceive Siamese network (SiamMP) that explicitly predicts motion patterns, providing complementary clues for the appearance-only formulation. Specifically, successive historical frames are collected, with their appearance and potential trajectory being employed to predict the next state, achieving motion awareness correspondingly. Besides, an adaptive fusion module is dedicated to performing a decision-level negotiation between the tracking evidence of the appearance and the motion models. To verify the effectiveness and merit of our SiamMP, extensive experiments are conducted on several challenging benchmarks, including LaSOT, OTB100, GOT-10k, TC128, and DTB70. The comparison and analysis of the obtained results demonstrate the necessity and validity of involving the motion-perceive design in the Siamese framework.
computer science, artificial intelligence