Symbiotic Attention: UTS-Baidu Submission to the EPIC-Kitchens 2020 Action Recognition Challenge

Xiaohan Wang,Yu Wu,Linchao Zhu,Yi Yang,Yueting Zhuang
2020-01-01
Abstract:In this report, we describe the technical details of our solution to the EPIC-Kitchens Action Recognition Challenge 2020. The EPIC-Kitchens dataset contains various small objects, intense motion blur, and occlusions. We tackle the egocentric action recognition task by suppressing background distractors and enhancing action-relevant interaction. First, we take candidate objects information to enable concentration on the occurring interactions. Second, we leverage a symbiotic attention mechanism with objectcentric alignment to encourage the mutual interaction between the two branches and select the most action-relevant candidates for classification. Third, we incorporate multiple modality inputs, i.e., RGB frames and optical flows, to further improve the performance by a multi-modal fusion. Our model ranked the first on both the seen and unseen test set on EPIC-Kitchens Action Recognition Challenge 2020. The code for our model will be available at https://github.com/wxh1996/SAP-EPIC.
What problem does this paper attempt to address?