Training-Conditional Coverage Bounds under Covariate Shift

Mehrdad Pournaderi,Yu Xiang
2024-05-26
Abstract:Training-conditional coverage guarantees in conformal prediction concern the concentration of the error distribution, conditional on the training data, below some nominal level. The conformal prediction methodology has recently been generalized to the covariate shift setting, namely, the covariate distribution changes between the training and test data. In this paper, we study the training-conditional coverage properties of a range of conformal prediction methods under covariate shift via a weighted version of the Dvoretzky-Kiefer-Wolfowitz (DKW) inequality tailored for distribution change. The result for the split conformal method is almost assumption-free, while the results for the full conformal and jackknife+ methods rely on strong assumptions including the uniform stability of the training algorithm.
Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of how to provide coverage guarantees under training conditions for a range of conformal prediction methods in the presence of covariate shift. Specifically, the paper investigates the training condition coverage properties of different conformal prediction methods when the covariate distribution shifts from the training data to the test data. This is done through a weighted version of the Dvoretzky-Kiefer-Wolfowitz (DKW) inequality that adjusts for distribution changes. For split conformal methods, the results are almost assumption-free, whereas the results for full conformal and Jackknife+ methods rely on strong assumptions, including the uniform stability of the training algorithm. In simple terms, the core questions of the paper are: - How to ensure that the confidence intervals of the prediction model effectively cover the true response values in the scenario of covariate shift? - How to quantify and control the uncertainty of predictions in such cases? These questions are crucial for improving the reliability and interpretability of machine learning models in real-world applications, especially when the input data distribution is inconsistent with the distribution observed during training.