Interaction Preservation Based Multi-feature Fusion for Video Synopsis Generation.

Kaixuan Yang,Zhixiang Zhu,Chenwu Wang,Pei Wang
DOI: https://doi.org/10.1145/3573942.3574057
2022-01-01
Abstract:Faced with a large number of surveillance videos, it will cause users to spend a lot of time browsing and retrieving videos, Video synopsis technology solves this problem, which condenses the spatiotemporally redundant long video into a compact summary video and preserves the activity information of all objects. However, the spatial and temporal displacement of objects during the synopsis process destroys the interaction between objects. This paper proposes a video synopsis method that preserves the interaction between objects. First, the depth prediction of the objects in the video is performed, and then the surveillance video is subjected to inverse perspective Mapping (IPM) to calculate the distance between objects, Finally, the depth and distance are used as the factors for judging the interaction relationship of objects, and the interaction relationship between the objects in the original video and the synopsis video is kept consistent by optimizing the interaction energy cost term. Experimental results show that the proposed method effectively preserves the interaction between objects in the synopsis.
What problem does this paper attempt to address?