Abstract: Understanding what information neural networks capture is an essential problem in deep learning, and studying whether different models capture similar features is an initial step to achieve this goal. Previous works sought to define metrics over the feature matrices to measure the difference between two models. However, different metrics sometimes lead to contradictory conclusions, and there has been no consensus on which metric is suitable to use in practice. In this work, we propose a novel metric that goes beyond previous approaches. Recall that one of the most practical scenarios of using the learned representations is to apply them to downstream tasks. We argue that we should design the metric based on a similar principle. For that, we introduce the transferred discrepancy (TD), a new metric that defines the difference between two representations based on their downstream-task performance. Through an asymptotic analysis, we show how TD correlates with downstream tasks and the necessity to define metrics in such a task-dependent fashion. In particular, we also show that under specific conditions, the TD metric is closely related to previous metrics. Our experiments show that TD can provide fine-grained information for varied downstream tasks, and for the models trained from different initializations, the learned features are not the same in terms of downstream-task predictions. We find that TD may also be used to evaluate the effectiveness of different training strategies. For example, we demonstrate that the models trained with proper data augmentations that improve the generalization capture more similar features in terms of TD, while those with data augmentations that hurt the generalization will not. This suggests a training strategy that leads to more robust representation also trains models that generalize better.

Distributional discrepancy: A metric for unconditional text generation

The Detection of Distributional Discrepancy for Text Generation

The detection of distributional discrepancy for language GANs

Open-Domain Text Evaluation via Contrastive Distribution Methods

Distribution Aware Metrics for Conditional Natural Language Generation

On the Relation Between Quality-Diversity Evaluation and Distribution-Fitting Goal in Text Generation

Assessing Dialogue Systems with Distribution Distances.

Differentiated Distribution Recovery for Neural Text Generation

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores

Transferred Discrepancy: Quantifying the Difference Between Representations

Understanding Counterfactual Generation Using Maximum Mean Discrepancy.

Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence

Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation

Diversity-Promoting GAN: A Cross-Entropy Based Generative Adversarial Network for Diversified Text Generation

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution

Attribute Based Interpretable Evaluation Metrics for Generative Models

DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text

Rethinking and Refining the Distinct Metric

Exploring Distributional Discrepancy for Multidimensional Point Set Retrieval