Modaldrop: Modality-Aware Regularization for Temporal-Spectral Fusion in Human Activity Recognition

Xin Zeng,Yiqiang Chen,Benfeng Xu,Tengxiang Zhang
DOI: https://doi.org/10.1109/ICASSP49357.2023.10095880
2023-01-01
Abstract:Although most of existing works for sensor-based Human Activity Recognition rely on the temporal view, we argue that the spectral view also provides complementary prior and accordingly benchmark a standard multi-view framework with extensive experiments to demonstrate its consistent superiority over single-view opponents. We then delve into the intrinsic mechanism of the multi-view representation fusion, and propose ModalDrop as a novel modality-aware regularization method to learn and exploit representations of both views effectively. We demonstrate its advantage over existing representation fusion alternatives with comprehensive experiments and ablations. The improvements are consistent for various settings and are orthogonal with different backbones. We also discuss its potential application for other related tasks regarding representation or modality fusion. The source code is available on https://github.com/studyzx/ModalDrop.git.
What problem does this paper attempt to address?