Abstract:Multimedia event detection (MED) plays an important role in many applications such as video indexing and retrieval. Current event detection works mainly focus on sports and news event detection or abnormality detection in surveillance videos. Differently, our research aims to detect more complicated and generic events within a longer video sequence. In the past, researchers have proposed using intermediate concept classifiers with concept lexica to help understand the videos. Yet it is difficult to judge how many and what concepts would be sufficient for the particular video analysis task. Additionally, obtaining robust semantic concept classifiers requires a large number of positive training examples, which in turn has high human annotation cost. In this paper, we propose an approach that exploits the external concepts-based videos and event-based videos simultaneously to learn an intermediate representation from video features. Our algorithm integrates the classifier inference and latent intermediate representation into a joint framework. The joint optimization of the intermediate representation and the classifier makes them mutually beneficial and reciprocal. Effectively, the intermediate representation and the classifier are tightly correlated. The classifier dependent intermediate representation not only accurately reflects the task semantics but is also more suitable for the specific classifier. Thus we have created a discriminative semantic analysis framework based on a tightly coupled intermediate representation. Extensive experiments on multimedia event detection using real-world videos demonstrate the effectiveness of the proposed approach.

Interactive Surveillance Event Detection through Mid-level Discriminative Representation

Foreground Gating and Background Refining Network for Surveillance Object Detection

Intelligent Video Surveillance for Checking Attendance of Traffic Controllers in Level Crossing

A Novel Method For Real-Time Object Detection And Multiple Persons Tracking

Real-Time Target Detection and Recognition with Deep Convolutional Networks for Intelligent Visual Surveillance

PKU-NEC @TRECVID2012 SED - Uneven-Sequence Based Event Detection in Surveillance Video.

A System Based On Sequence Learning For Event Detection In Surveillance Video

A Framework for an Event Driven Video Surveillance System

Esur: A System for Events Detection in Surveillance Video

Semi-supervised Early Event Detection.

A Representative-Based Framework For Parsing And Summarizing Events In Surveillance Videos

Experiential Sampling for Video Surveillance

Training-free Monocular 3D Event Detection System for Traffic Surveillance

Multi-object Events Recognition from Video Sequences Using Extended Finite State Machine

Event Composition with Imperfect Information for Bus Surveillance

Bi-Level Semantic Representation Analysis for Multimedia Event Detection

Effective video event detection via subspace projection

A Novel Event-Oriented Segment-Of-Interest Discovery Method For Surveillance Video

PKU@TRECVID2010: Pair-Wise Event Detection in Surveillance Video.

Multimedia Event Detection Using A Classifier-Specific Intermediate Representation

Event-based Large Scale Surveillance Video Summarization.