Semantically Supervised Maximal Correlation for Cross-Modal Retrieval

Mingyang Li,Yang Li,Shao-Lun Huang,Lin Zhang
DOI: https://doi.org/10.1109/icip40778.2020.9190873
2020-01-01
Abstract:With the rapid growth of multimedia data, the cross-modal retrieval problem has attracted a lot of interest in both research and industry in recent years. However, the inconsistency of data distribution from different modalities makes such task challenging. In this paper, we propose Semantically Supervised Maximal Correlation (S2MC) method for cross-modal retrieval by incorporating semantic label information into the traditional maximal correlation framework. Combining with maximal correlation based method for extracting unsupervised pairing information, our method effectively exploits supervised semantic information on both common feature space and label space. Extensive experiments show that our method outperforms other current state-of-the-art methods on cross-modal retrieval tasks on three widely used datasets.
What problem does this paper attempt to address?