Transformer Based Online Continuous Multi-Target Tracking with State Regression
Xinwei Wei,Linxiu Chen,Chenyu Zhang,Yiru Lin,Linao Zhang,Xiaokai Liu,Jixiang Jiang,Wei Yi
DOI: https://doi.org/10.1109/iccais59597.2023.10382279
2023-01-01
Abstract:Multi-target Tracking (MTT) is the process of processing received measurements to maintain estimates of the current status of multiple targets, with important applications to autonomous driving, aerial reconnaissance, underwater operations, and others. In the model-based setting, Bayesian filtering can provide the theoretical optimal estimate in a single target scenario. However, in complex situations, uncertain factors such as changes in the number of targets will cause the amount of calculation to increase exponentially, resulting in a decline in tracking accuracy. To solve that problem, model-free methods based on deep-learning provide an attractive alternative, especially the state-of-the-art architecture Transformer based encoder-decoder prediction model, which outperforms the Bayesian filters in the single frame prediction tasks. However, when switching to continuous tracking, these algorithms need to be trained separately frame by frame to adapt to the new tasks. Still, there is no correlation between their predictions from different frames, which prevents them from fully utilizing all the measurements. In this paper, we propose an end-to-end Transformer based MTT method with state autoregression, which allows the model to have the capability of online continuous tracking and make total use of the entire trajectory. The results show that the proposed model is a great extension from single-frame prediction to online continuous tracking.