CFASL: Composite Factor-Aligned Symmetry Learning for Disentanglement in Variational AutoEncoder

Hee-Jun Jung,Jaehyoung Jeong,Kangil Kim
2024-01-19
Abstract:Symmetries of input and latent vectors have provided valuable insights for disentanglement learning in VAEs.However, only a few works were proposed as an unsupervised method, and even these works require known factor information in training data. We propose a novel method, Composite Factor-Aligned Symmetry Learning (CFASL), which is integrated into VAEs for learning symmetry-based disentanglement in unsupervised learning without any knowledge of the dataset factor information.CFASL incorporates three novel features for learning symmetry-based disentanglement: 1) Injecting inductive bias to align latent vector dimensions to factor-aligned symmetries within an explicit learnable symmetry codebook 2) Learning a composite symmetry to express unknown factors change between two random samples by learning factor-aligned symmetries within the codebook 3) Inducing group equivariant encoder and decoder in training VAEs with the two conditions. In addition, we propose an extended evaluation metric for multi-factor changes in comparison to disentanglement evaluation in VAEs. In quantitative and in-depth qualitative analysis, CFASL demonstrates a significant improvement of disentanglement in single-factor change, and multi-factor change conditions compared to state-of-the-art methods.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the problem of disentanglement in Variational Autoencoders (VAEs) in unsupervised learning scenarios, particularly when the factor information in the dataset is unknown. Specifically, the paper proposes a new method called "Composite Factor-Aligned Symmetry Learning" (CFASL), which achieves factor disentanglement based on symmetry through the following three main features: 1. **Injecting Inductive Bias**: Aligning the symmetry of latent vector dimensions with factors through an explicitly learnable symmetry codebook. 2. **Learning Composite Symmetry**: Learning factor-aligned symmetry within the codebook to express the variations of unknown factors between two random samples. 3. **Inducing Group Equivariant Encoders and Decoders**: Further improving disentanglement by using group equivariant encoders and decoders under specific conditions during VAE training. Additionally, the paper proposes an extended evaluation metric (m-FVM k) to assess disentanglement performance under multi-factor variation conditions and demonstrates through quantitative and qualitative analysis that CFASL significantly improves over existing methods under both single-factor and multi-factor variation conditions.