Bilinear Dynamics for Crowd Video Analysis

Shuang Wu,Hang Su,Hua Yang,Shibao Zheng,Yawen Fan,Qin Zhou
DOI: https://doi.org/10.1016/j.jvcir.2017.01.026
IF: 2.887
2017-01-01
Journal of Visual Communication and Image Representation
Abstract:In this paper, a novel crowd descriptor, termed as bilinear CD (Curl and Divergence) descriptor, is proposed based on the bilinear interaction of curl and divergence. Specifically, the curl and divergence activation maps are computed from the normalized average flow. A local curl patch and the corresponding divergence patch are cropped respectively from the activation maps. The outer product of the two local patches is defined as the bilinear CD vector. Through sliding a window on the activation maps, we can get hundreds to thousands local bilinear CD vectors. To encode them into a compact representation, fisher vector pooling and PCA algorithms are applied on the local descriptors. Experiments on the CUHK crowd dataset show that the proposed bilinear dynamics can improve the performance of video classification and retrieval by a noticeable margin when compared with the existing crowd features. (C) 2017 Published by Elsevier Inc.
What problem does this paper attempt to address?