Adaptive Conformal Inference by Betting

Aleksandr Podkopaev,Darren Xu,Kuang-Chih Lee
2024-12-27
Abstract:Conformal prediction is a valuable tool for quantifying predictive uncertainty of machine learning models. However, its applicability relies on the assumption of data exchangeability, a condition which is often not met in real-world scenarios. In this paper, we consider the problem of adaptive conformal inference without any assumptions about the data generating process. Existing approaches for adaptive conformal inference are based on optimizing the pinball loss using variants of online gradient descent. A notable shortcoming of such approaches is in their explicit dependence on and sensitivity to the choice of the learning rates. In this paper, we propose a different approach for adaptive conformal inference that leverages parameter-free online convex optimization techniques. We prove that our method controls long-term miscoverage frequency at a nominal level and demonstrate its convincing empirical performance without any need of performing cumbersome parameter tuning.
Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key problems in online adaptive conformal inference, especially in the data stream scenario. Specifically: 1. **Limitations of the data exchangeability assumption**: - Existing conformal inference methods rely on the data exchangeability assumption, that is, all data points are independently and identically distributed (i.i.d.). However, in practical applications, especially when dealing with time - series data or sequentially arriving data, this assumption usually does not hold. For example, data may change over time, resulting in distribution shift. - The paper proposes a method that does not require any assumptions about the data - generation process to deal with such non - exchangeable data. 2. **The problem of learning rate selection**: - Traditional online gradient - descent methods need to pre - set the learning rate, and their performance is very sensitive to the choice of the learning rate. If the learning rate is too high, it may lead to overly conservative or unstable prediction intervals; if it is too low, it may lead to slow convergence and inability to adapt to changes in the data distribution in a timely manner. - The paper proposes a method based on non - parametric online convex optimization techniques, avoiding the cumbersome learning - rate tuning process. 3. **Balance between long - term coverage and cumulative regret**: - The paper not only focuses on long - term coverage but also considers other performance indicators such as cumulative regret to ensure the effectiveness and practicality of the method. - By using sub - linear regret bounds, the paper proves that the proposed method can reach the preset level in long - term coverage and performs well in practice. ### Method overview The paper proposes an adaptive conformal inference method based on "betting", using non - parametric online convex optimization techniques. The specific steps are as follows: - **Framework**: Consider the learning process as a game, in which the player places bets according to the results of consecutive coin flips. The coin - flip results are determined by the negative sub - gradient of the pinball loss function. - **Update rule**: Update the radius of the prediction interval through the Krichevsky - Trofimov (KT) estimator or the Online Newton Step (ONS) method. - **Theoretical guarantee**: Prove that the proposed method can reach the preset level in long - term coverage and has sub - linear regret bounds, thus ensuring the effectiveness and stability of the method. ### Experimental verification The paper verifies the effectiveness of the proposed method through a series of experiments, including: - **Simulated data sets**: Show the performance under different learning rates and prove that the proposed method can achieve near - optimal performance without parameter tuning. - **Real - world data sets**: For example, the power - demand - prediction data set, show the performance in multi - step prediction, further verifying the practicality and robustness of the method. In conclusion, this paper solves the limitations of existing conformal inference methods in non - exchangeable data and learning - rate selection, and provides a more flexible, stable, and easy - to - implement online adaptive conformal inference method.