A Top-View Multiple People Tracking System Based on Newest YOLOv5 and DeepSort Using Depth Data
Xianglong You,Dongxiao Li,Ming Zhang
DOI: https://doi.org/10.1109/icaica54878.2022.9844474
2022-01-01
Abstract:Nowadays, with more and more video surveillance systems constructed in our society, pedestrian tracking has become an important issue that has always been discussed in the computer vision domain all over the world. How to propose a real-time tracking system with great accuracy has always been the core of this problem, it’s a bit of a paradox because you want to have a precise detection result and a real-time tracking system at the same time, but we know the best method to make target detection is a deep neural network, which is time-consuming to process immense amounts of data. And to solve the ID-switches problem, we introduce the DeepSort algorithm to make the tracking process done. So in this paper, we propose a real-time tracking system using top-view depth data by integrating the newest YOLOv5 with DeepSort that can achieve nearly 40 frames per second of a high-quality video stream. And the orientation of our camera is top-view, which can help the neural network distinguish a bunch of people with occlusion easily, at the same time we choose depth data to avoid privacy leaks problem. Once we have an accurate detection result then we use the Kalman filter and Hungarian algorithm and matching cascade to handle the matching process of multiple detection and tracks. To authenticate the surveillance system we proposed, we conducted a few experiments on different datasets that meet our data requirements, we also recorded two datasets by various cameras in our laboratory and outdoor environment. In addition, the results showed the superior advantages of top-view depth data in tracking by the detection system and improved the tracking accuracy to 99.3% which is the best mAP@0.5 of alike methods. And all experiments conducted on different video streams can reach a real-time level and verify the effectiveness of this system as well.
What problem does this paper attempt to address?