Abstract:Identifying latent variables and causal structures from observational data is essential to many real-world applications involving biological data, medical data, and unstructured data such as images and languages. However, this task can be highly challenging, especially when observed variables are generated by causally related latent variables and the relationships are nonlinear. In this work, we investigate the identification problem for nonlinear latent hierarchical causal models in which observed variables are generated by a set of causally related latent variables, and some latent variables may not have observed children. We show that the identifiability of causal structures and latent variables (up to invertible transformations) can be achieved under mild assumptions: on causal structures, we allow for multiple paths between any pair of variables in the graph, which relaxes latent tree assumptions in prior work; on structural functions, we permit general nonlinearity and multi-dimensional continuous variables, alleviating existing work's parametric assumptions. Specifically, we first develop an identification criterion in the form of novel identifiability guarantees for an elementary latent variable model. Leveraging this criterion, we show that both causal structures and latent variables of the hierarchical model can be identified asymptotically by explicitly constructing an estimation procedure. To the best of our knowledge, our work is the first to establish identifiability guarantees for both causal structures and latent variables in nonlinear latent hierarchical models.

Learning Discrete Concepts in Latent Hierarchical Models

Learning Disjunctive Concepts Based on Fuzzy Semantic Cell Models Through Principles of Justifiable Granularity and Maximum Fuzzy Entropy

Towards Human-like Perception: Learning Structural Causal Model in Heterogeneous Graph

Differentiable Causal Discovery For Latent Hierarchical Causal Models

Learning Hierarchically Structured Concepts

Learning Hierarchical Concepts Based on Higher-Order Fuzzy Semantic Cell Models Through the Feed-Upward Mechanism and the Self-Organizing Strategy.

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

Identification of Nonlinear Latent Hierarchical Models

Learning Visual Hierarchies with Hyperbolic Embeddings

A Generalized Hierarchical Multi-Latent Space Model for Heterogeneous Learning

Learning Topic Hierarchies by Tree-Directed Latent Variable Models

Structural Causality-based Generalizable Concept Discovery Models

Discrete Latent Structure in Neural Networks

Learning Interpretable Concept-Based Models with Human Feedback

Learning Unseen Concepts Via Hierarchical Decomposition and Composition

Hierarchical Models: Intrinsic Separability in High Dimensions

Hierarchical Latent Concept Discovery for Video Event Detection

Learning causal structures using hidden compact representation

Latent Variable Modeling for Generative Concept Representations and Deep Generative Models

Probing the Latent Hierarchical Structure of Data via Diffusion Models

Learning Hierarchical Features from Generative Models