Abstract:Tennis has becoming an increasingly popular sport throughout the world. Tennis motion recognition based on 3D video has attracted more and more attention in recent years. The algorithm based on dynamic time warping takes into account the timing sequence information of movements and can solve the uncertainty of human movement at temporal level. By increasing the training samples, the efficiency will decrease accordingly. This work presents a tennis action recognition framework based on action standard sequence. The 3D action video samples are incorporated into action sequences by feature extraction, wherein the action standard sequences are encoded as a sequence averaging optimization problem under the dynamic time normalization metric. The dynamic time normalization barycenter averaging algorithm (DBA) is leveraged to solve this problem. For the tennis scenery with significant differences in the action categories, we study the standard sequence learning of multiple actions, and accordingly propose a DBA-K-means clustering algorithm for unsupervised learning. Herein, a human tennis action recognition by integrating feature optimization and image similarity is proposed. The three dimensional reduction methods, including principal component analysis (PCA),PCA + Pearson, and PCA+ Spearman, were compared to prove that PCA+ Pearson correlation coefficient had the best dimensional reduction effect. Meanwhile, the global feature eight-star model is combined with the local feature HOG feature after dimensionally reduced to fully represent human movements. The similarity between pairwise adjacent frames of images was calculated. The statistical weight of single frame SVM classification results within a discriminant period is adaptively allocated, and finally the body pose recognition results are classified twice. Experiments on standard data set KTH show that the recognition accuracy of this algorithm is 94.5%, which is better than other methods. It has a good application value in the field of video human motion recognition. Also, we have demonstrated that this method can further improve the efficiency and accuracy of action recognition. Effective feature extraction is beneficial to improve the accuracy of subsequent human action recognition.

Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features

STAN: Spatial-Temporal Awareness Network for Temporal Action Detection

3D-SSD: Learning Hierarchical Features from RGB-D Images for Amodal 3D Object Detection

TriDet: Temporal Action Detection with Relative Boundary Modeling

Efficient 3D Position Estimation in Badminton Scene.

Deep Spatial/temporal-level feature engineering for Tennis-based action recognition

Deep-Learning-Based Computer Vision Approach For The Segmentation Of Ball Deliveries And Tracking In Cricket

Pose Estimation for Swimmers in Video Surveillance

Computational Analysis of Table Tennis Matches from Real-Time Videos Using Deep Learning.

Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset

Multi-camera Temporal Grouping for Play/Break Event Detection in Soccer Games

Motion-aware and data-independent model based multi-view 3D pose refinement for volleyball spike analysis

Boundary Discretization and Reliable Classification Network for Temporal Action Detection

Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation

Automated recognition of the cricket batting backlift technique in video footage using deep learning architectures

Application of deep learning in automatic detection of technical and tactical indicators of table tennis

Enhanced Sports Video Shot Boundary Detection Based on Middle Level Features and a Unified Model

Distribution-Aware Activity Boundary Representation for Online Detection of Action Start in Untrimmed Videos

A CNN-based approach to classify cricket bowlers based on their bowling actions

Sports Field Registration Via Keypoints-aware Label Condition

CASRM: Cricket Automation and Stroke Recognition Model Using OpenPose