Class-Incremental Learning with Multiscale Distillation for Weakly Supervised Temporal Action Localization.

Tianquan Chen,Bairong Li,Yusheng Tao,Yuqing Wang,Yuesheng Zhu
DOI: https://doi.org/10.1007/978-3-031-30105-6_31
2022-01-01
Abstract:Despite recent works having made great progress in weakly supervised temporal action localization (WTAL), they still suffer from catastrophic forgetting. When only new-class videos can be utilized to update these models, their performance in old classes diminishes drastically. Even while some class-incremental learning methods are presented to assist models in continuously learning new-class knowledge, most of them focus on image classification but pay little attention to WTAL. To fill this gap, we propose a novel class-incremental learning method with multiscale distillation, which mines two separate scales of old-class information in incoming videos for updating the model. Precisely, we calculate class activation sequences (CAS) with frame-level spatio-temporal information to provide fine-grained old-class labels for the updated model. Moreover, since the high activation segments contain rich action information, we select them and construct video-level logits to constrain the updated model for maintaining the old-class knowledge further. The experimental results under various incremental learning settings on the THUMOS’14 and ActivityNet 1.3 datasets reveal that our method effectively alleviates the catastrophic forgetting problem in WTAL.
What problem does this paper attempt to address?