Towards More Accurate Tiny Object Tracking: Benchmark and Algorithm

Jizhe Yu,Yu Liu,Hongkui Wei,Kaiping Xu
DOI: https://doi.org/10.1109/cscwd61410.2024.10580827
2024-01-01
Abstract:Despite the significant progress in tiny object tracking algorithms based on convolutional neural networks, the performance falls short of the ideal level due to dataset challenges like limited scale, limited match backdrop, low match popularity, and low-resolution(720p) images. The lack of specialized large-scale tiny object tracking datasets leads data-hungry tracking algorithms to rely on small and saturated datasets from TrackNet. This paper propels the development of tiny object tracking and contributes the first large-scale dataset, Badminton100K, for tiny ball tracking. We provide over 100K annotations covering over 200 videos from 12 professional match events. Our dataset covers high-resolution(1080p) match videos in broad and diverse contexts. Moreover, considering the fast-moving tiny balls in videos often leads to challenges such as blur, afterimage, overlap, and disappearance. As another contribution, we also propose a novel Multi-stage and Multi-scale Fusion Network (MMFNet), which can extract local high-resolution details in the shallow stages and capture global semantic information in the deep stages, addressing these challenging tasks through multiple stages multi-scale feature fusion, enabling accurate identification and localization of tiny balls. Extensive experiments show that our method achieves unprecedented state-of-the-art tracking performance on Badminton100K. The dataset and the algorithm code are available at https://github.com/Gi-gigi/MMFNet.
What problem does this paper attempt to address?