Guest Editorial: Ad Hoc Web Multimedia Analysis with Limited Supervision

Yahong Han,Yi Yang,Jingdong Wang
DOI: https://doi.org/10.1007/s11042-014-2419-y
IF: 2.577
2015-01-01
Multimedia Tools and Applications
Abstract:With the popularity of social media applications and Web 2.0 techniques, there is an explosive growth of web multimedia data generated from user-sharing web sites such as Flickr, YouTube, and Facebook. The social characteristics and the increased scalability turn out to be a great challenge in the semantic understanding and retrieval of web multimedia. Though the users’ comments and tagging can be well exploited to provide more semantic cues for the web multimedia analysis, the annotations of these data contain a lot of noisy tags and are always weakly tagging. Thus, the supervision information available is limited due to the huge output space. Furthermore, for web multimedia analysis, the negative examples come from an infinite semantic space and we have no clue about the semantics these negative examples include. Thus, how to construct the generic model for each ad hoc multimedia analysis task (a.k.a. AdHocWeb Multimedia Analysis) is a challenging problem for the social media applications on the web. The research of ad hoc web multimedia analysis with limited supervision is an interesting and fundamental research area, which involves several fields, ranging from machine learning, multimedia retrieval, and computer vision to data mining. This issue consists of 11 papers, which are briefly discussed as follows. Visual content analysis is a fundamental problem in ad hoc multimedia analysis, which will facilitate the semantic understanding of multimedia data. In this issue, two papers investigate visual content analysis and its applications in super-resolution and image-based localization on mobile phone. The “Depth map Super-Resolution based on joint dictionary learning” (10.1007/s11042-014-2002-6) paper utilizes unsupervised dictionary learning for depth map super-resolution. This method transforms a low-resolution depth map to a high-resolution depth map. Different from previous depth map super-resolution methods, the proposed algorithm uses a joint dictionary learning method with both lowand high-resolution depth Multimed Tools Appl (2015) 74:463–465 DOI 10.1007/s11042-014-2419-y
What problem does this paper attempt to address?