MotionTrack: rethinking the motion cue for multiple object tracking in USV videos

Zhenqi Liang,Gang Xiao,Jianqiu Hu,Jingshi Wang,Chunshan Ding
DOI: https://doi.org/10.1007/s00371-023-02983-y
2023-07-10
Abstract:Multiple object tracking (MOT) in unmanned surface vehicle (USV) videos has many application scenarios in the military and civilian fields. State-of-the-art MOT methods first extract a set of detections from the video frames, then utilize IoU distance to associate the detections of current frame and tracklets of last frame, and finally adopt linear Kalman filter to estimate the current position of tracklets. However, some problems in USV videos seriously affect the tracking performance, such as low frame rate, wobble of observation platform, nonlinear motion of objects, small objects and ambiguous appearance. In this paper, we fully explore the motion cue in USV videos and propose a simple but effective tracker, named MotionTrack. Equipping with YOLOv7 as object detector, the data association of MotionTrack is mainly composed of cascade matching with Gaussian distance module and observation-centric Kalman filter module. We validate the effectiveness with extensive experiments on the recent Jari-Maritime-Tracking-2022 dataset, achieving new state-of-the-art 46.9 MOTA, 49.2 IDF1 with 35.2 FPS running speed on a single 3090 GPU. The source code, pretrained models with deploy versions are released at https://github.com/lzq11/MotionTrack.
computer science, software engineering
What problem does this paper attempt to address?