Cross-domain video action recognition via adaptive gradual learning

Dan Liu,Zhenwei Bao,Jinpeng Mi,Yan Gan,Mao Ye,Jianwei Zhang
DOI: https://doi.org/10.1016/j.neucom.2023.126622
IF: 6
2023-08-02
Neurocomputing
Abstract:Video-based Unsupervised Domain Adaptation (UDA) methods concentrate on addressing domain shift and improving the robustness of video models. It can be naturally applied to cross-domain action recognition tasks, but the inherent complexity of videos makes this task more challenging. Though several recent attempts have achieved superior performance in the video UDA field, most of them intend to mitigate domain shift directly from the source domain to the target domain. These methods will cause a performance drop when the domain distribution shift is large. To better realize the domain adaptation, this paper proposes an effective gradual learning framework by constructing multiple auxiliary domains to achieve progressive transfer for cross-domain video action recognition. Specifically, a Dynamic CutMix Mechanism (DCM) is introduced to build the auxiliary domains that mitigate the domain gap caused by object distance and background discrepancies. Furthermore, the Gradual Transfer Strategy (GTS) utilizes these auxiliary domains to realize the cross-domain action classification gradually. Extensive experiments validate the effectiveness of our proposed method, and the experimental results can significantly outperform state-of-the-art methods on multiple benchmark cross-domain datasets.
computer science, artificial intelligence
What problem does this paper attempt to address?