Learning Descriptors with Cube Loss for View-Based 3-D Object Retrieval

Dong Wang,Hongxun Yao,Federico Tombari,Sicheng Zhao,Bin Wang,Hong Liu
DOI: https://doi.org/10.1109/tmm.2019.2892004
IF: 7.3
2019-01-01
IEEE Transactions on Multimedia
Abstract:3-D object retrieval has been a hot research topic in recent years. Within such a field, view-based approaches are attracting increasing attention because of the flexibility of data representation as well as the reported state-of-the-art performance. One of the most important issues related to view-based 3-D object retrieval is how to learn embedding features that are discriminative across classes while being compactly distributed within each class. In this paper, we analyze the difference between the two tasks of classification and retrieval, and propose a novel way to learn a view-pooling feature via a triplet network. In addition, we propose a new loss, named cube loss, which is able to sample a number of triplets equal to the cube of the samples in a batch. With the new loss, both hard-negative and hard-positive pairs can be effectively investigated. The experimental results on the ModelNet benchmark demonstrate that the proposed method achieves superior performance compared to state-of-the-art approaches.
What problem does this paper attempt to address?