Disentanglement Learning via Topology

Nikita Balabin,Daria Voronkova,Ilya Trofimov,Evgeny Burnaev,Serguei Barannikov
2024-06-05
Abstract:We propose TopDis (Topological Disentanglement), a method for learning disentangled representations via adding a multi-scale topological loss term. Disentanglement is a crucial property of data representations substantial for the explainability and robustness of deep learning models and a step towards high-level cognition. The state-of-the-art methods are based on VAE and encourage the joint distribution of latent variables to be factorized. We take a different perspective on disentanglement by analyzing topological properties of data manifolds. In particular, we optimize the topological similarity for data manifolds traversals. To the best of our knowledge, our paper is the first one to propose a differentiable topological loss for disentanglement learning. Our experiments have shown that the proposed TopDis loss improves disentanglement scores such as MIG, FactorVAE score, SAP score, and DCI disentanglement score with respect to state-of-the-art results while preserving the reconstruction quality. Our method works in an unsupervised manner, permitting us to apply it to problems without labeled factors of variation. The TopDis loss works even when factors of variation are correlated. Additionally, we show how to use the proposed topological loss to find disentangled directions in a trained GAN.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to learn disentangled representations. Specifically, the author proposes a method named TopDis (Topological Disentanglement), which achieves the learning of disentangled representations by adding a multi - scale topological loss term to the variational auto - encoder (VAE) loss. Disentangled representations mean that each representation component corresponds to a factor of variation, which is crucial for improving the interpretability and robustness of deep - learning models and is a step towards high - level cognition. ### Main Problems and Methods 1. **Problem Background**: - Disentangled representations are crucial for the success of machine - learning models, especially in terms of interpretability and robustness. - Existing methods are mainly based on VAE, which encourages the joint distribution of latent variables to be decomposed into independent factors. - The author proposes a new perspective to solve the disentangling problem from the topological properties of data manifolds. 2. **Method Overview**: - **TopDis Method**: By optimizing the topological similarity of data - manifold traversals, a differentiable topological loss is proposed. - **Technical Details**: - **Multi - scale Topological Loss**: Use topological data analysis tools (such as RTD) to measure the topological differences between two point clouds. - **Group (oid) Action**: Keep the Gaussian distribution unchanged by performing group (oid) - action translations in the latent space. - **Gradient Orthogonalization**: In order to balance the reconstruction quality and topological regularization, orthogonalize the gradients between the LTD loss and the reconstruction loss. ### Experimental Results 1. **Benchmark Tests**: - The author conducted experiments on multiple standard datasets, including dSprites, 3D Shapes, 3D Faces, MPI 3D and CelebA. - The experimental results show that the TopDis loss significantly improves various disentangling metrics (such as MIG, FactorVAE score, SAP score and DCI disentangling score) without degrading the reconstruction quality. 2. **Quantitative Evaluation**: - Table 1 shows the experimental results on different datasets, indicating that the TopDis loss can significantly improve the disentangling performance on various models (such as β - VAE, FactorVAE, β - TCVAE, ControlVAE and DAVA). - For example, on the dSprites dataset, the FactorVAE score of β - VAE combined with TopDis is increased from 0.807 to 0.833, and the MIG is increased from 0.272 to 0.348. ### Conclusion By introducing the multi - scale topological loss term, the author has successfully improved the learning method of disentangled representations. The TopDis method not only performs excellently on various disentangling metrics but also can achieve this without sacrificing the reconstruction quality. This method provides a new perspective and tool for the research of disentangled representations.