DecoR: Deconfounding Time Series with Robust Regression

Felix Schur,Jonas Peters
2024-06-11
Abstract:Causal inference on time series data is a challenging problem, especially in the presence of unobserved confounders. This work focuses on estimating the causal effect between two time series, which are confounded by a third, unobserved time series. Assuming spectral sparsity of the confounder, we show how in the frequency domain this problem can be framed as an adversarial outlier problem. We introduce Deconfounding by Robust regression (DecoR), a novel approach that estimates the causal effect using robust linear regression in the frequency domain. Considering two different robust regression techniques, we first improve existing bounds on the estimation error for such techniques. Crucially, our results do not require distributional assumptions on the covariates. We can therefore use them in time series settings. Applying these results to DecoR, we prove, under suitable assumptions, upper bounds for the estimation error of DecoR that imply consistency. We show DecoR's effectiveness through experiments on synthetic data. Our experiments furthermore suggest that our method is robust with respect to model misspecification.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to estimate the causal effect between two time series in the presence of unobserved confounding factors. Specifically, when a third unobserved time series acts as a confounding factor affecting the first two time series, how to accurately estimate the causal relationship between them. The paper assumes that the confounding factors are sparse in the frequency domain and proposes a new method - deconfounding by robust regression (DecoR), which uses robust linear regression in the frequency domain to estimate the causal effect. This method can not only handle non - independently and identically distributed data, but also can provide consistent estimation results under the assumption that the confounding factors are sparse. In addition, the paper also provides theoretical guarantees, proving that under appropriate assumptions, the estimation error of DecoR is bounded, and shows the effectiveness of this method on synthetic data.