Automating the Selection of Proxy Variables of Unmeasured Confounders

Feng Xie,Zhengming Chen,Shanshan Luo,Wang Miao,Ruichu Cai,zhi geng
DOI: https://doi.org/10.48550/arxiv.2405.16130
2024-01-01
Abstract:Recently, interest has grown in the use of proxy variables of unobservedconfounding for inferring the causal effect in the presence of unmeasuredconfounders from observational data. One difficulty inhibiting the practicaluse is finding valid proxy variables of unobserved confounding to a targetcausal effect of interest. These proxy variables are typically justified bybackground knowledge. In this paper, we investigate the estimation of causaleffects among multiple treatments and a single outcome, all of which areaffected by unmeasured confounders, within a linear causal model, without priorknowledge of the validity of proxy variables. To be more specific, we firstextend the existing proxy variable estimator, originally addressing a singleunmeasured confounder, to accommodate scenarios where multiple unmeasuredconfounders exist between the treatments and the outcome. Subsequently, wepresent two different sets of precise identifiability conditions for selectingvalid proxy variables of unmeasured confounders, based on the second-orderstatistics and higher-order statistics of the data, respectively. Moreover, wepropose two data-driven methods for the selection of proxy variables and forthe unbiased estimation of causal effects. Theoretical analysis demonstratesthe correctness of our proposed algorithms. Experimental results on bothsynthetic and real-world data show the effectiveness of the proposed approach.
What problem does this paper attempt to address?