Consumer Video Understanding

Yu-Gang Jiang,Guangnan Ye,Shih-Fu Chang,Daniel Ellis,Alexander C. Loui
DOI: https://doi.org/10.1145/1991996.1992025
2011-01-01
Abstract:Recognizing visual content in unconstrained videos has become a very important problem for many applications. Existing corpora for video analysis lack scale and/or content diversity, and thus limited the needed progress in this critical area. In this paper, we describe and release a new database called CCV, containing 9,317 web videos over 20 semantic categories, including events like "baseball" and "parade", scenes like "beach", and objects like "cat". The database was collected with extra care to ensure relevance to consumer interest and originality of video content without post-editing. Such videos typically have very little textual annotation and thus can benefit from the development of automatic content analysis techniques.
What problem does this paper attempt to address?