PAGML: Precise Alignment Guided Metric Learning for sketch-based 3D shape retrieval
Shaojin Bai,Jing Bai,Hao Xu,Jiwen Tuo,Min Liu
DOI: https://doi.org/10.1016/j.imavis.2023.104756
IF: 3.86
2023-07-03
Image and Vision Computing
Abstract:Sketch-based 3D shape retrieval has always been a hot research topic in the computer vision community. The main challenge is to alleviate the cross-modality discrepancies such that the retrieval accuracy can be improved. In this paper, we propose a novel Precise Alignment Guided Metric Learning (PAGML) method based on master-auxiliary cross-modality retrieval framework. An auxiliary learning network is developed to indirectly guide the master learning model to extract features of rich semantic information, so as to achieve a semantic alignment between the cross-modality data. Furthermore, affected by the intra-class variability and inter-class imbalance issue, the learned class distributions may exhibit unevenness in the common embedding space and cause poor retrieval performance. A loss function dedicated for cross-modality retrieval is designed to achieve a rigid alignment between sketches and 3D shapes of the same category by pulling their rich semantic representations to the rigid center of the category. As a result, a more precise alignment between the cross-modality embedding features of the same category is approached gradually, which further alleviates the cross-modality discrepancies, inter-class variability, and inter-class imbalance, thus improving the cross-modality retrieval accuracies. Extensive experiments on two public benchmark datasets demonstrate that the proposed PAGML surpasses the state-of-the-art methods in retrieval accuracy and has excellent generalization abilities to unseen classes.
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, software engineering,optics