Weakly-paired Deep Dictionary Learning for Cross-Modal Retrieval

Huaping Liu,Feng Wang,Xinyu Zhang,Fuchun Sun
DOI: https://doi.org/10.1016/j.patrec.2018.06.021
IF: 4.757
2018-01-01
Pattern Recognition Letters
Abstract:Many multi-modal data suffers from significant weak-pairing characteristics, i.e., there is no sample-to-sample correspondence between modalities, rather classes of samples in one modality correspond to classes of samples in the other modality. This provides great challenges for the cross-modal learning for retrieval. In this work, our focus is learning cross-modal representations with minimal class label supervision and without correspondences between samples. To tackle this challenging problem, we establish a scalable hierarchical learning architecture to deal with the extensive weakly-paired heterogeneous multi-modal data. A shared classifier across different modalities is used to effectively deal with label supervision information, and a multi-modal low-rank model is introduced to encourage the modal-invariant representation. Finally, some cross-modal validations on publicly available datasets are performed to show the advantages of the proposed method. (C) 2018 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?