Abstract:The growing rate of public space CCTV installations has generated a need for automated methods for exploiting video surveillance data including scene understanding, query, behaviour annotation and summarization. For this reason, extensive research has been performed on surveillance scene understanding and analysis. However, most studies have considered single scenes, or groups of adjacent scenes. The semantic similarity between different but related scenes (e.g., many different traffic scenes of similar layout) is not generally exploited to improve any automated surveillance tasks and reduce manual effort. Exploiting commonality, and sharing any supervised annotations, between different scenes is however challenging due to: Some scenes are totally un-related -- and thus any information sharing between them would be detrimental; while others may only share a subset of common activities -- and thus information sharing is only useful if it is selective. Moreover, semantically similar activities which should be modelled together and shared across scenes may have quite different pixel-level appearance in each scene. To address these issues we develop a new framework for distributed multiple-scene global understanding that clusters surveillance scenes by their ability to explain each other's behaviours; and further discovers which subset of activities are shared versus scene-specific within each cluster. We show how to use this structured representation of multiple scenes to improve common surveillance tasks including scene activity understanding, cross-scene query-by-example, behaviour classification with reduced supervised labelling requirements, and video summarization. In each case we demonstrate how our multi-scene model improves on a collection of standard single scene models and a flat model of all scenes.

Video Structural Description: A Semantic Based Model for Representing and Organizing Video Surveillance Big Data

Semantic based representing and organizing surveillance big data using video structural description technology

Video structural description technology for the new generation video surveillance systems

Hierarchical Video Data Modeling and Indexing for Virtual Scene Construction

Semantic enhanced cloud environment for surveillance data management using video structural description

Crowd Sensing Based Semantic Annotation of Surveillance Videos.

&Lt;title>automatic Traffic Real-Time Analysis System Based on Video</title>

The Big Data Analytics and Applications of the Surveillance System Using Video Structured Description Technology.

Hierarchical organization for medical video summarization using latent visual and semantic analysis

From Time to Space: Automatic Annotation of Unmarked Traffic Scene Based on Trajectory Data.

Semantic-based surveillance video retrieval

Event-based Large Scale Surveillance Video Summarization.

Discovery of Shared Semantic Spaces for Multi-Scene Video Query and Summarization

Video Semantic Models : Survey and Evaluation

Construction and Application of Video Big Data Analysis Platform for Smart City Development

A Statistics-Based Method For Video Semantic Analysis

AN ADAPTIVE ORGANIZATION METHOD OF GEOVIDEO DATA FOR SPATIO-TEMPORAL ASSOCIATION ANALYSIS

Video Data Mining: Semantic Indexing and Event Detection from the Association Perspective

Mining Semantic Context Information for Intelligent Video Surveillance of Traffic Scenes

Semantic Link Network-Based Model for Organizing Multimedia Big Data

A Representative-Based Framework For Parsing And Summarizing Events In Surveillance Videos