VidVRD 2021

Wei Ji,Yicong Li,Meng Wei,Xindi Shang,Junbin Xiao,Tongwei Ren,Tat-Seng Chua
DOI: https://doi.org/10.1145/3474085.3479232
2021-01-01
Abstract:ACM Multimedia 2021 Video Relation Understanding Challenge is the third grand challenge which aims at exploring the relationship of subjects and objects appearing in videos for fine-grained and high-level video understanding. Given a video, the video relation detection model should output a serious of relation triplet subject, predicate, object and the corresponding trajectories of subject and object. The goal of this task is to promote research on developing video semantic understanding model, so as to perform complex inferences and mining of visual knowledge in videos. In this paper, we make a comprehensive and detailed introduction of this task, conclude the proposed algorithms in the last few years, and propose future direction for research in this task.
What problem does this paper attempt to address?