Abstract:Learning an effective distance measurement between sample pairs plays an important role in visual analysis, where the training procedure largely relies on hard negative samples. However, hard negative samples usually account for the tiny minority in the training set, which may fail to fully describe the data distribution close to the decision boundary. In this paper, we present a deep adversarial metric learning (DAML) framework to generate synthetic hard negatives from the original negative samples, which is widely applicable to existing supervised deep metric learning algorithms. Different from existing sampling strategies which simply ignore numerous easy negatives, our DAML aim to exploit them by generating synthetic hard negatives adversarial to the learned metric as complements. We simultaneously train the feature embedding and hard negative generator in an adversarial manner, so that adequate and targeted synthetic hard negatives are created to learn more precise distance metrics. As a single transformation may not be powerful enough to describe the global input space under the attack of the hard negative generator, we further propose a deep adversarial multi-metric learning (DAMML) method by learning multiple local transformations for more complete description. We simultaneously exploit the collaborative and competitive relationships among multiple metrics, where the metrics display unity against the generator for effective distance measurement as well as compete for more training data through a metric discriminator to avoid overlapping. Extensive experimental results on five benchmark datasets show that our DAML and DAMML effectively boost the performance of existing deep metric learning approaches through adversarial learning.

Deep Metric Learning For The Target Cost In Unit-Selection Speech Synthesizer

A data driven method for target and concatenation cost calculation with KL-Divergence in Mandarin hybrid speech synthesis

Progressive Neural Networks Based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer

DNN-based unit selection using frame-sized speech segments

DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization

Extracting Unit Embeddings Using Sequence-To-Sequence Acoustic Models for Unit Selection Speech Synthesis

HMM-based Unit Selection Speech Synthesis Using Log Likelihood Ratios Derived from Perceptual Data

Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis

Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis.

Deep Metric Learning Via Adaptive Learnable Assessment

Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models

A novel unit selection method for concatenation speech system using similarity measure

Improve distance metric learning by learning positions of class centers

Distance-Dependent Metric Learning.

Scalable Angular Discriminative Deep Metric Learning for Face Recognition

Deep Adversarial Metric Learning

Deep Factorized Metric Learning

Cost-Sensitive Deep Metric Learning for Fine-Grained Image Classification.

Metric Learning for Keyword Spotting

Anchor-aware Deep Metric Learning for Audio-visual Retrieval

Perceptual Clustering Based Unit Selection Optimization for Concatenative Text-to-speech Synthesis