Adaptive Cross-Camera Video Analytics at the Edge

Kaiyang Chen,Yifei Zhu,Zhu Han,Xudong Wang
DOI: https://doi.org/10.1109/mass56207.2022.00063
2022-01-01
Abstract:Cross-camera video analytics is a major video ana-lytic task that associates and analyzes information across multiple cameras. However, the searching cost for existing cross-camera tracking tasks grows linearly with the number of cameras, leading to substantial cost in large-scale camera systems. Although correlation among cameras can greatly reduce the searching cost, our empirical analysis reveals that the correlation actually changes over time, leading to sub-optimal performance for schemes leveraging rigid correlation information. Furthermore, adjusting the correlations to dynamically guide the searching process is extremely challenging due to the high construction cost. In this paper, we propose an adaptive cross-camera video analytics framework under the guidance of fine-grained estimated correlation information. Specifically, we propose a mean-field game approach to estimate the dynamic correlation with only the initial correlation and the destination correlation. We first carefully craft the cost functions and constraint functions to model the dynamics of the users in the camera systems, and formulate the correlation estimation problem as a tracking-cost minimization problem. Considering the enormous number of interactions embedded in the problem, we further reformulate the proposed problem by introducing the correlation as the mean-field term. Given the complexity to solve the equilibrium, we adopt a G-prox primal-dual hybrid gradient algorithm to solve our problem efficiently. Consequently, the correlation from the initial to the destination can also be inferred over time. Extensive experiments on a real-world dataset demonstrate that our adaptive cross-camera video analytics framework based on fine-grained correlation can reduce the overall workload by 36 % in general. For queries with a large searching space, the overall workload reduction can even be reduced by 40 times with 6 % precision improvement.
What problem does this paper attempt to address?