Social Relation Graph Generation on Untrimmed Video.

Yibo Hu,Chenghao Yan,Chenyu Cao,Haorui Wang,Bin Wu
DOI: https://doi.org/10.1007/978-3-031-27818-1_61
2023-01-01
Abstract:For a more intuitive understanding of videos, we demonstrate SRGG-UnVi, a social relation graph generation system for untrimmed videos. Given a video, the demonstration can combine existing knowledge to build a dynamic relation graph and a static multi-relation graph. SRGG-UnVi integrates various multimodal technologies, including Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), face recognition and clustering, multimodal video relation extraction, etc. The system consists of three modules: (1) The video process engine takes advantage of parallelization, efficiently providing multimodal information to other modules. (2) The relation recognition module utilize multimodal information to extract the relationship between characters in each scene. (3) The graph generation module generates social relation graph for users.
What problem does this paper attempt to address?