Abstract:Classifiers for primitive visual concepts like "car", "sky" have been well developed and widely used to support video search on simple queries. However, it is usually ineffective for complex queries like "one or more people at a table or desk with a computer visible", as they carry semantics far more complex and different from simply aggregating the meanings of their constituent primitive concepts. To facilitate video search of complex queries, we propose a higher-level semantic descriptor named "concept bundle", which integrates multiple primitive concepts, such as "(soccer, fighting)", "(lion, hunting, zebra)" etc, to describe the visual representation of the complex semantics. The proposed approach first automatically selects informative concept bundles. It then builds a novel concept bundle classifier based on multi-task learning by exploiting the relatedness between concept bundle and its primitive concepts. To model a complex query, it proposes an optimal selection strategy to select related primitive concepts and concept bundles by considering both their classifier performance and semantic relatedness with respect to the query. The final results are generated by fusing the individual results from these selected primitive concepts and concept bundles. Extensive experiments are conducted on two video datasets: TRECVID 2008 and YouTube datasets. The experimental results indicate that: (a) our concept bundle learning approach outperforms the state-of-the-art methods by at least 19% and 29% on TRECVID 2008 and YouTube datasets, respectively; and (b) the use of concept bundles can improve the search performance for complex queries by at least 37.5% on TRECVID 2008 and 52% on YouTube datasets.

Learning Structured Concept-Segments for Interactive Video Retrieval

Mapping Query to Semantic Concepts: Leveraging Semantic Indices for Automatic and Interactive Video Retrieval

Query Representation by Structured Concept Threads with Application to Interactive Video Retrieval.

Utilizing Related Samples to Learn Complex Queries in Interactive Concept-Based Video Search

Semantic Video Search by Exploiting Large-Scale Visual Concepts

Utilizing Related Samples to Enhance Interactive Concept-Based Video Search

Video search in concept subspace: a text-like paradigm

The importance of query-concept-mapping for automatic video retrieval.

An integrated semantic-based approach in concept based video retrieval

Interpretable Embedding for Ad-hoc Video Search

Learning Concept Bundles for Video Search with Complex Queries

Explicit and implicit concept-based video retrieval with bipartite graph propagation model.

Learning a Multi-Concept Video Retrieval Model with Multiple Latent Variables

Fast And Accurate Content-Based Semantic Search In 100m Internet Videos

Interactive Video Indexing with Statistical Active Learning

Graph-based Multi-Space Semantic Correlation Propagation for Video Retrieval

Video diver: generic video indexing with diverse features.

Improving Interpretable Embeddings for Ad-hoc Video Search with Generative Captions and Multi-word Concept Bank

An Effective Video Retrieval Approach Based on Multi-modality Concept Correlation Graph

Semantic Concept Learning Through Massive Internet Video Mining

Diving Into The Relations: Leveraging Semantic and Visual Structures For Video Moment Retrieval