Combining Link And Content Correlation Learning For Cross-Modal Retrieval In Social Multimedia

Longtao Zhang,Fangfang Liu,Zhimin Zeng
DOI: https://doi.org/10.1007/978-3-319-74521-3_54
2018-01-01
Abstract:With the rapid growth of multimedia data, cross-modal retrieval has received great attention. Generally, learning semantics correlation is the primary solution for eliminating heterogeneous gap between modalities. Existing approaches usually focus on modeling cross-modal correlation and category correlation, which can't capture semantic correlation thoroughly for social multimedia data. In fact, the diverse link information is complementary to provide rich hints for semantic correlation. In this paper, we propose a novel cross-modal correlation learning approach based on subspace learning by taking heterogeneous social link and content information into account. Both intra-modal and inter-modal correlation are simultaneously considered through explicitly modeling link information. Additionally, those correlations are incorporated into final representation, which further improve the performance of cross modal retrieval effectively. Experimental results demonstrate that the proposed approach performs better comparing with several state-of-the-art cross-modal correlation learning approaches.
What problem does this paper attempt to address?