Supplement Material of Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition

Ke Cheng,Yifan Zhang,Congqi Cao,Lei Shi,Jian Cheng,Hanqing Lu
2020-01-01
Abstract:We discuss the setting of K in spatial DropGraph and temproal DropGraph respectively. In the spatial graph, the total number of nodes are typically less than 30 (25 nodes for NTU-RGBD, 18 nodes for Skeleton-Kinetics). Hence, we discuss K = 0, 1, 2 in Table 1. Note that when K = 0, the spatial DropGraph degenerates into dropping isolated joints, which is not effective. By expanding the drop area to 1 neighbor nodes, spatial DropGraph provides efficient regularization. Expanding drop area to 2 neighbor may cause too strong regularization. In the temporal graph, the total number of nodes are typically more than 100 (300 frames for NTU-RGBD, 150 frames for Skeleton-Kinetics). Hence, we discuss K = 0, 5, 10, 20, 30 in Table 2. When K = 0, the temporal DropGraph degenerated to drop isolated frames, which is not efficient. Temporal DropGraph can provide efficient regularization when K = 5, 10, 20. When K = 30, the regularization is too strong. We use this setting for NTU-RGBD and NTU-RGBD-120. For NW-UCLA, we set temporal K = 5 due to the small number of frames in NW-UCLA action samples.
What problem does this paper attempt to address?