Abstract:We introduce a conditional generative model for learning to disentangle the hidden factors of variation within a set of labeled observations, and separate them into complementary codes. One code summarizes the specified factors of variation associated with the labels. The other summarizes the remaining unspecified variability. During training, the only available source of supervision comes from our ability to distinguish among different observations belonging to the same class. Examples of such observations include images of a set of labeled objects captured at different viewpoints, or recordings of set of speakers dictating multiple phrases. In both instances, the intra-class diversity is the source of the unspecified factors of variation: each object is observed at multiple viewpoints, and each speaker dictates multiple phrases. Learning to disentangle the specified factors from the unspecified ones becomes easier when strong supervision is possible. Suppose that during training, we have access to pairs of images, where each pair shows two different objects captured from the same viewpoint. This source of alignment allows us to solve our task using existing methods. However, labels for the unspecified factors are usually unavailable in realistic scenarios where data acquisition is not strictly controlled. We address the problem of disentaglement in this more general setting by combining deep convolutional autoencoders with a form of adversarial training. Both factors of variation are implicitly captured in the organization of the learned embedding space, and can be used for solving single-image analogies. Experimental results on synthetic and real datasets show that the proposed method is capable of generalizing to unseen classes and intra-class variabilities.

Disentanglement via Mechanism Sparsity Regularization: A New Principle for Nonlinear ICA

Nonparametric Partial Disentanglement via Mechanism Sparsity: Sparse Actions, Interventions and Sparse Temporal Dependencies

Generalizing Nonlinear ICA Beyond Structural Sparsity

Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA

Learning Causally Disentangled Representations via the Principle of Independent Causal Mechanisms

Disentanglement of Latent Representations via Causal Interventions

Nonlinear independent component analysis for principled disentanglement in unsupervised deep learning

On the Identifiability of Sparse ICA without Assuming Non-Gaussianity

Disentanglement via Latent Quantization

Towards efficient representation identification in supervised learning

Continual Learning of Nonlinear Independent Representations

Nonlinear ICA Using Volume-Preserving Transformations.

Efficient Probabilistic Latent Semantic Analysis with Sparsity Control

L1-Regularized ICA: A Novel Method for Analysis of Task-related fMRI Data

Disentangling Factors of Variation in Deep Representations Using Adversarial Training.

Identifiable Feature Learning for Spatial Data with Nonlinear ICA

A Sparsity Principle for Partially Observable Causal Representation Learning

ℓ 1 -Regularized ICA: A Novel Method for Analysis of Task-Related fMRI Data

Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation

Representation Disentaglement via Regularization by Causal Identification

Disentangled Representation Learning for Causal Inference with Instruments