Abstract:Various innovative and original works have been applied and proposed in the field of sports video analysis. However, individual works have focused on sophisticated methodologies with particular sport types and there has been a lack of scalable and holistic frameworks in this field. This article proposes a solution and presents a systematic and generic approach which is experimented on a relatively large-scale sports consortia. The system aims at the event detection scenario of an input video with an orderly sequential process. Initially, domain knowledge-independent local descriptors are extracted homogeneously from the input video sequence. Then the video representation is created by adopting a bag-of-visual-words (BoW) model. The video’s genre is first identified by applying the k-nearest neighbor (k-NN) classifiers on the initially obtained video representation, and various dissimilarity measures are assessed and evaluated analytically. Subsequently, an unsupervised probabilistic latent semantic analysis (PLSA)-based approach is employed at the same histogram-based video representation, characterizing each frame of video sequence into one of four view groups, namely closed-up-view, mid-view, long-view, and outer-field-view. Finally, a hidden conditional random field (HCRF) structured prediction model is utilized for interesting event detection. From experimental results, k-NN classifier using KL-divergence measurement demonstrates the best accuracy at 82.16&percnt; for genre categorization. Supervised SVM and unsupervised PLSA have average classification accuracies at 82.86&percnt; and 68.13&percnt;, respectively. The HCRF model achieves 92.31&percnt; accuracy using the unsupervised PLSA based label input, which is comparable with the supervised SVM based input at an accuracy of 93.08&percnt;. In general, such a systematic approach can be widely applied in processing massive videos generically.

A Unified Framework for Semantic Shot Representation of Sports Video

A Unified Framework for Semantic Shot Classification in Sports Videos

Semantic Shot Classification in Sports Video

A mid-level representation framework for semantic sports video analysis.

A semantic description scheme of soccer video based on MPEG-7

A Generic Approach to Classify Sports Video Shots and Its Application in Event Detection

Video Content Representation for Shot Retrieval and Scene Extraction.

A lightweight weak semantic framework for cinematographic shot classification

Statistical Framework For Shot Segmentation And Classification In Sports Video

Shot Classification of Sports Video Based on Features in Motion Vector Field

A Unified Framework for Shot Type Classification Based on Subject Centric Lens

A Mid-Level Visual Concept Generation Framework for Sports Analysis

A Fusion Scheme of Visual and Auditory Modalities for Event Detection in Sports Video.

Semantic Video Shot Segmentation Based on Color Ratio Feature and SVM

Shot Content Analysis for Video Retrieval Applications

Content-based Sports Video Analysis and Modeling

Semantic Event Extraction From Basketball Games Using Multi-Modal Analysis

A Compact Shot Representation for Video Semantic Indexing

Enhanced Sports Video Shot Boundary Detection Based on Middle Level Features and a Unified Model

Shot classification and replay detection for sports video summarization

A Generic Approach for Systematic Analysis of Sports Videos