Abstract:In cross-view action recognition, "what you saw" in one view is different from "what you recognize" in another view. The data distribution even the feature space can change from one view to another due to the appearance and motion of actions drastically vary across different views. In this paper, we address the problem of transferring action models learned in one view (source view) to another different view (target view), where action instances from these two views are represented by heterogeneous features. A novel learning method, called Heterogeneous Transfer Discriminantanalysis of Canonical Correlations (HTDCC), is proposed to learn a discriminative common feature space for linking source and target views to transfer knowledge between them. Two projection matrices that respectively map data from source and target views into the common space are optimized via simultaneously minimizing the canonical correlations of inter-class samples and maximizing the intraclass canonical correlations. Our model is neither restricted to corresponding action instances in the two views nor restricted to the same type of feature, and can handle only a few or even no labeled samples available in the target view. To reduce the data distribution mismatch between the source and target views in the common feature space, a nonparametric criterion is included in the objective function. We additionally propose a joint weight learning method to fuse multiple source-view action classifiers for recognition in the target view. Different combination weights are assigned to different source views, with each weight presenting how contributive the corresponding source view is to the target view. The proposed method is evaluated on the IXMAS multi-view dataset and achieves promising results.

Cross-View Action Recognition Based on Hierarchical View-Shared Dictionary Learning.

Cross-View Action Recognition Via Dual-Codebook and Hierarchical Transfer Framework

Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition

Cross-View Action Recognition over Heterogeneous Feature Spaces

Cross-view action recognition via view knowledge transfer

Cross-view Action Recognition by Cross-Domain Learning.

Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition.

Deeply Learned View-Invariant Features for Cross-View Action Recognition

Cross-view Action Recognition Via Transductive Transfer Learning

View-invariant Human Action Recognition Via Robust Locally Adaptive Multi-View Learning

Cross-modality Online Distillation for Multi-View Action Recognition

View-invariant feature discovering for multi-camera human action recognition

Discriminative virtual views for cross-view action recognition

Multi-layer Representation for Cross-view Action Recognition

Cross-View Action Recognition Via a Continuous Virtual Path

Weakly supervised cross-view action recognition via sequential motion accumulation

Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton Based Action Recognition

A Cross View Learning Approach for Skeleton-Based Action Recognition

Multiple Continuous Virtual Paths Based Cross-View Action Recognition.

Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective

Discriminative Multi-View Dynamic Image Fusion for Cross-View 3-D Action Recognition