Mixed Graphical Models for Causal Analysis of Multi-modal Variables

Andrew J Sedgewick,Joseph D. Ramsey,Peter Spirtes,Clark Glymour,Panayiotis V. Benos
DOI: https://doi.org/10.48550/arXiv.1704.02621
2017-04-09
Abstract:Graphical causal models are an important tool for knowledge discovery because they can represent both the causal relations between variables and the multivariate probability distributions over the data. Once learned, causal graphs can be used for classification, feature selection and hypothesis generation, while revealing the underlying causal network structure and thus allowing for arbitrary likelihood queries over the data. However, current algorithms for learning sparse directed graphs are generally designed to handle only one type of data (continuous-only or discrete-only), which limits their applicability to a large class of multi-modal biological datasets that include mixed type variables. To address this issue, we developed new methods that modify and combine existing methods for finding undirected graphs with methods for finding directed graphs. These hybrid methods are not only faster, but also perform better than the directed graph estimation methods alone for a variety of parameter settings and data set sizes. Here, we describe a new conditional independence test for learning directed graphs over mixed data types and we compare performances of different graph learning strategies on synthetic data.
Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?