Two-Stage Model for Social Relationship Understanding from Videos

Pilin Dai,Jinna Lv,Bin Wu
DOI: https://doi.org/10.1109/icme.2019.00198
2019-01-01
Abstract:Social relationship understanding from videos bears an enormous potential for social media analysis. However, most existing researches only explored spatio-temporal features from videos, ignoring rich contextual information hidden in semantic objects. In this paper, we propose a Two-Stage Model (TSM), first introducing object information for social relationship understanding from videos, to our best knowledge. In the first stage, rich and robust representation for social relationship is obtained by the extraction of both spatio-temporal features and semantic objects information. In the second stage, we utilize a propagated knowledge graph to capture the interaction between semantic objects and video scenes. Specially, an attention mechanism is employed to measure the effectiveness of each semantic object on different scenes. Extensive experiments demonstrate our TSM achieves the state-of-the-art performance on the dataset SRIV.
What problem does this paper attempt to address?