MAML MOT: Multiple Object Tracking based on Meta-Learning

Jiayi Chen,Chunhua Deng
2024-08-23
Abstract:With the advancement of video analysis technology, the multi-object tracking (MOT) problem in complex scenes involving pedestrians is gaining increasing importance. This challenge primarily involves two key tasks: pedestrian detection and re-identification. While significant progress has been achieved in pedestrian detection tasks in recent years, enhancing the effectiveness of re-identification tasks remains a persistent challenge. This difficulty arises from the large total number of pedestrian samples in multi-object tracking datasets and the scarcity of individual instance samples. Motivated by recent rapid advancements in meta-learning techniques, we introduce MAML MOT, a meta-learning-based training approach for multi-object tracking. This approach leverages the rapid learning capability of meta-learning to tackle the issue of sample scarcity in pedestrian re-identification tasks, aiming to improve the model's generalization performance and robustness. Experimental results demonstrate that the proposed method achieves high accuracy on mainstream datasets in the MOT Challenge. This offers new perspectives and solutions for research in the field of pedestrian multi-object tracking.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the issue of insufficient effectiveness in pedestrian re-identification (Re-ID) tasks during multi-object tracking (MOT) in complex scenarios. Despite significant progress in pedestrian detection tasks in recent years, the Re-ID task still faces the challenge of sample scarcity. This is particularly evident in multi-object tracking datasets, where the total number of pedestrian samples is large, but the number of samples for each individual instance is small. This leads to poor generalization performance and robustness of the model in Re-ID tasks. To solve this problem, the authors introduce a meta-learning-based approach called MAML MOT. This method leverages the rapid learning capability of meta-learning to quickly adapt to new tasks with a small number of samples, thereby improving the model's generalization performance and robustness. Experimental results show that MAML MOT achieves high accuracy on mainstream MOT Challenge datasets, providing a new perspective and solution for pedestrian multi-object tracking research.