A theoretical study of Y structures for causal discovery

Subramani Mani,Peter L. Spirtes,Gregory F. Cooper
DOI: https://doi.org/10.48550/arXiv.1206.6853
2012-06-28
Abstract:There are several existing algorithms that under appropriate assumptions can reliably identify a subset of the underlying causal relationships from observational data. This paper introduces the first computationally feasible score-based algorithm that can reliably identify causal relationships in the large sample limit for discrete models, while allowing for the possibility that there are unobserved common causes. In doing so, the algorithm does not ever need to assign scores to causal structures with unobserved common causes. The algorithm is based on the identification of so called Y substructures within Bayesian network structures that can be learned from observational data. An example of a Y substructure is A -> C, B -> C, C -> D. After providing background on causal discovery, the paper proves the conditions under which the algorithm is reliable in the large sample limit.
Artificial Intelligence,Methodology
What problem does this paper attempt to address?