Person-Based Video Summarization And Retrieval By Tracking And Clustering Temporal Face Sequences

Tong Zhang,Di Wen,Xiaoqing Ding
DOI: https://doi.org/10.1117/12.2009127
2013-01-01
Abstract:People are often the most important subjects in videos. It is highly desired to automatically summarize the occurrences of different people in a large collection of video and quickly find the video clips containing a particular person among them. In this paper, we present a person-based video summarization and retrieval system named VideoWho which extracts temporal face sequences in videos and groups them into clusters, with each cluster containing video clips of the same person. This is accomplished based on advanced face detection and tracking algorithms, together with a semi-supervised face clustering approach. The system achieved good clustering accuracy when tested on a hybrid video set including home video, TV plays and movies. On top of this technology, a number of applications can be built, such as automatic summarization of major characters in videos, person-related video search on the Internet and personalized UI systems etc.
What problem does this paper attempt to address?