Audio Feature Learning with Triplet-Based Embedding Network.

Xiaoyu Qi,Deshun Yang,Xiaoou Chen
DOI: https://doi.org/10.1609/aaai.v31i1.11071
2017-01-01
Proceedings of the AAAI Conference on Artificial Intelligence
Abstract:We propose a triplet-based network for audio feature learning for version identification. Existing methods use hand-crafted features for a music as a whole while we learn features by a triplet-based neural network on segment-level, focusing on the most similar parts between music versions. We conduct extensive experiments and demonstrate our merits.
What problem does this paper attempt to address?