An Introduction to Causal Discovery

Martin Huber
DOI: https://doi.org/10.48550/arXiv.2407.08602
2024-07-11
Abstract:In social sciences and economics, causal inference traditionally focuses on assessing the impact of predefined treatments (or interventions) on predefined outcomes, such as the effect of education programs on earnings. Causal discovery, in contrast, aims to uncover causal relationships among multiple variables in a data-driven manner, by investigating statistical associations rather than relying on predefined causal structures. This approach, more common in computer science, seeks to understand causality in an entire system of variables, which can be visualized by causal graphs. This survey provides an introduction to key concepts, algorithms, and applications of causal discovery from the perspectives of economics and social sciences. It covers fundamental concepts like d-separation, causal faithfulness, and Markov equivalence, sketches various algorithms for causal discovery, and discusses the back-door and front-door criteria for identifying causal effects. The survey concludes with more specific examples of causal discovery, e.g. for learning all variables that directly affect an outcome of interest and/or testing identification of causal effects in observational data.
Econometrics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is, in social sciences and economics, how to discover causal relationships among multiple variables from a data - driven perspective, rather than relying on pre - defined causal structures to evaluate the impact of specific interventions on pre - defined outcomes. Specifically, the paper explores the method of Causal Discovery, which reveals causal relationships in variable systems by analyzing statistical associations, different from traditional causal inference methods. Traditional methods usually focus on evaluating the causal effects of specific treatment variables (such as educational programs, health treatments or marketing interventions) on the outcomes of interest (such as income, health or sales), while Causal Discovery aims to understand the causal associations within the entire variable system, and these associations can be represented by causal graphs. The main contributions of the paper lie in providing an introduction to the key concepts, algorithms and applications of Causal Discovery, especially from the perspectives of economics and social sciences. The paper discusses basic concepts such as d - separation, causal faithfulness and Markov equivalence, outlines multiple Causal Discovery algorithms, and discusses the back - door criterion and the front - door criterion for identifying the causal effects of pre - defined treatments. In addition, the paper also illustrates, through specific examples, how to use Causal Discovery to learn all variables that directly affect the outcomes of interest and to test the identification of causal effects in observational data.