The Multi-Modal Video Reasoning and Analyzing Competition

Haoran Peng,He Huang,Li Xu,Tianjiao Li,Jun Liu,Hossein Rahmani,Qiuhong Ke,Zhicheng Guo,Cong Wu,Rongchang Li,Mang Ye,Jiahao Wang,Jiaxu Zhang,Yuanzhong Liu,Tao He,Fuwei Zhang,Xianbin Liu,Tao Lin
DOI: https://doi.org/10.48550/arXiv.2108.08344
2021-08-19
Abstract:In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summarize the top-performing methods submitted by the participants in this competition and show their results achieved in the competition.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?