Using VAEs to Learn Latent Variables: Observations on Applications in cryo-EM

Daniel G. Edelberg,Roy R. Lederman

2023-05-11

Abstract:Variational autoencoders (VAEs) are a popular generative model used to approximate distributions. The encoder part of the VAE is used in amortized learning of latent variables, producing a latent representation for data samples. Recently, VAEs have been used to characterize physical and biological systems. In this case study, we qualitatively examine the amortization properties of a VAE used in biological applications. We find that in this application the encoder bears a qualitative resemblance to more traditional explicit representation of latent variables.

Machine Learning,Quantitative Methods

What problem does this paper attempt to address?

This paper attempts to explore some characteristic issues of Variational Autoencoders (VAE) in the field of cryo-electron microscopy (cryo-EM). Specifically: 1. **Generalization Ability of the VAE Encoder**: - The paper focuses on the encoder part of the VAE and studies its generalization ability on cryo-EM data. Experiments show that in this case, the encoder does not generalize well to unseen data. 2. **Comparison with Traditional Explicit Variable Representation Methods**: - Researchers use a method called Variational Lookup Table (VLT) to explicitly represent the latent variables of each particle image instead of using the encoder in the VAE. The results show that the performance of VLT is similar to or even better than the original VAE method, indicating that the encoder is not the key factor for the success of the VAE. 3. **Relationship Between Input Data and Latent Variables**: - In the experiments, researchers replaced the input data with arbitrary images or even random noise but still used the original images for loss function calculation. The results show that even with random noise as input, the algorithm can still work relatively well, suggesting that the encoder may have overfitted to some extent to individual images. 4. **Invariance of the Encoder**: - The paper tests the generalization ability of the encoder to augmented data using known invariances in cryo-EM data (such as rotation and translation). The results indicate that the encoder does not maintain invariance well in this case, implying that it may also fail to generalize well to completely new test data. In summary, this paper aims to reveal some potential issues of VAE in certain scientific applications and emphasizes that the generalization ability of the encoder may not always be reliable, especially in the field of cryo-EM.

Using VAEs to Learn Latent Variables: Observations on Applications in cryo-EM

Variational autoencoders learn transferrable representations of metabolomics data

VAEs in the Presence of Missing Data

Hidden Talents of the Variational Autoencoder

The road from MLE to EM to VAE: A brief tutorial

How to train your VAE

ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders

Learning Energy-Based Model with Variational Auto-Encoder as Amortized Sampler

Physics and Chemistry from Parsimonious Representations: Image Analysis via Invariant Variational Autoencoders

Three Variations on Variational Autoencoders

Variational Encoder-Decoders for Learning Latent Representations of Physical Systems

Latent Space Diffusion Models of Cryo-EM Structures

AUTO-ENCODING VARIATIONAL BAYES

Disentangling Generative Factors of Physical Fields Using Variational Autoencoders

Modeling conditional distributions of neural and behavioral data with masked variational autoencoders

$Γ$-VAE: Curvature regularized variational autoencoders for uncovering emergent low dimensional geometric structure in high dimensional data

Variational autoencoders with latent high-dimensional steady geometric flows for dynamics

Tutorial on Variational Autoencoders

Variational AutoEncoder For Regression: Application to Brain Aging Analysis

Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders

From the Expectation Maximisation Algorithm to Autoencoded Variational Bayes