Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022

Jiachen Lei,Shaoping Ma,Zhongjie Ba,Sai Vemprala,Ashish Kapoor,Kui Ren
DOI: https://doi.org/10.48550/arxiv.2211.15286
2022-01-01
Abstract:In this report, we present our approach and empirical results of applying masked autoencoders in two egocentric video understanding tasks, namely, Object State Change Classification and PNR Temporal Localization, of Ego4D Challenge 2022. As team TheSSVL, we ranked 2nd place in both tasks. Our code will be made available.
What problem does this paper attempt to address?