Abstract:Cross-media retrieval of scientific and technological information is one of the important tasks in the cross-media study. Cross-media scientific and technological information retrieval obtain target information from massive multi-source and heterogeneous scientific and technological resources, which helps to design applications that meet users' needs, including scientific and technological information recommendation, personalized scientific and technological information retrieval, etc. The core of cross-media retrieval is to learn a common subspace, so that data from different media can be directly compared with each other after being mapped into this subspace. In subspace learning, existing methods often focus on modeling the discrimination of intra-media data and the invariance of inter-media data after mapping; however, they ignore the semantic consistency of inter-media data before and after mapping and media discrimination of intra-semantics data, which limit the result of cross-media retrieval. In light of this, we propose a scientific and technological information oriented Semantics-adversarial and Media-adversarial Cross-media Retrieval method (SMCR) to find an effective common subspace. Specifically, SMCR minimizes the loss of inter-media semantic consistency in addition to modeling intra-media semantic discrimination, to preserve semantic similarity before and after mapping. Furthermore, SMCR constructs a basic feature mapping network and a refined feature mapping network to jointly minimize the media discriminative loss within semantics, so as to enhance the feature mapping network's ability to confuse the media discriminant network. Experimental results on two datasets demonstrate that the proposed SMCR outperforms state-of-the-art methods in cross-media retrieval.

Learning a Semantic Space for Modeling Images, Tags and Feelings in Cross-Media Search.

A Benchmark Dataset and Learning High-Level Semantic Embeddings of Multimedia for Cross-Media Retrieval.

Learning A Semantic Space from User'S Relevance Feedback for Image Retrieval

Facebook5k: A Novel Evaluation Resource Dataset for Cross-Media Search

Semantic Consistency Hashing for Cross-Modal Retrieval

Multiple Kernel Visual-Auditory Representation Learning for Retrieval

Crossmedia retrieval by learning rich semantic embeddings of multimedia

Learning Semantic Correlations for Cross-Media Retrieval.

Image Retrieval Based on Fuzzy Semantic Relevance Matrix

Cross-Media Retrieval: Concepts, Advances And Challenges

Cross-Modal Image-Text Retrieval with Semantic Consistency

Modeling Image Data for Effective Indexing and Retrieval in Large General Image Databases.

Measuring the Semantic Relatedness Between Images Using Social Tags.

A Deep Semantic Alignment Network for the Cross-Modal Image-Text Retrieval in Remote Sensing

Cross-Domain Feature Learning in Multimedia

Measuring semantic relatedness between Flickr images: from a social tag based view.

Sampled Image Tagging and Retrieval Methods on User Generated Content

Cross-View Feature Learning for Scalable Social Image Analysis.

Cross-Modal Image-Tag Relevance Learning for Social Images

Cross-media semantic representation via bi-directional learning to rank.

Scientific and Technological Information Oriented Semantics-adversarial and Media-adversarial Cross-media Retrieval