Overcomplete graph convolutional denoising autoencoder for noisy skeleton action recognition

Jiajun Guo,Qingge Ji,Guangwei Shan
DOI: https://doi.org/10.1049/ipr2.12944
IF: 2.3
2023-10-07
IET Image Processing
Abstract:To deal with incomplete and noisy skeletons in real‐world action recognition, an overcomplete Graph Convolutional Denoising Autoencoder (GCDAE) is proposed. The overcomplete and fully graph convolutional structure allows it to rectify noisy joints while preserving unspoiled details, making it possible to combine with any pretrained recognition backbones and improve their robustness in an efficient way. Current skeleton‐based action recognition methods usually assume the input skeleton is complete and noise‐free. However, it is inevitable that the captured skeletons are incomplete due to occlusions or noisy due to changes in the environment. When dealing with these data, even State Of The Art (SOTA) recognition backbones experience significant degradation in recognition accuracy. Though a few methods have been proposed to address this issue, they still lack flexibility, efficiency and interpretability. In this work, an overcomplete Graph Convolutional Denoising Autoencoder (GCDAE) is proposed which can act as a flexible preprocessing module for pretrained recognition backbones and improve their robustness. Taking advantages of the overcomplete and fully graph convolutional structure, GCDAE is able to rectify noisy joints while keeping information of unspoiled details efficiently. On two large scale skeleton datasets NTU RGB+D 60 and 120, the introducing of GCDAE brings significant robustness improvements to SOTA backbones towards different types of noises.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?