Abstract:Based on the framework of Conformal Prediction (CP), we study the online construction of valid confidence sets given a black-box machine learning model. By converting the target confidence levels into quantile levels, the problem can be reduced to predicting the quantiles (in hindsight) of a sequentially revealed data sequence. Two very different approaches have been studied previously. (i) Direct approach: Assuming the data sequence is iid or exchangeable, one could maintain the empirical distribution of the observed data as an algorithmic belief, and directly predict its quantiles. (ii) Indirect approach: As statistical assumptions often do not hold in practice, a recent trend is to consider the adversarial setting and apply first-order online optimization to moving quantile losses (Gibbs & Candès, 2021). It requires knowing the target quantile level beforehand, and suffers from certain validity issues on the obtained confidence sets, due to the associated loss linearization. This paper presents a novel Bayesian CP framework that combines their strengths. Without any statistical assumption, it is able to both: (i) answer multiple arbitrary confidence level queries online, with provably low regret; and (ii) overcome the validity issues suffered by first-order optimization baselines, due to being "data-centric" rather than "iterate-centric". From a technical perspective, our key idea is to regularize the algorithmic belief of the above direct approach by a Bayesian prior, which "robustifies" it by simulating a non-linearized Follow the Regularized Leader (FTRL) algorithm on the output. For statisticians, this can be regarded as an online adversarial view of Bayesian inference. Importantly, the proposed belief update backbone is shared by prediction heads targeting different confidence levels, bringing practical benefits analogous to U-calibration (Kleinberg et al., 2023).

Monty Hall and Optimized Conformal Prediction to Improve Decision-Making with LLMs

Conformal Prediction with Large Language Models for Multi-Choice Question Answering

Leveraging Large Language Models for Multiple Choice Question Answering

API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access

ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees

Efficiently Deploying LLMs with Controlled Risk

Conformal Contextual Robust Optimization

Conformal Prediction Sets Improve Human Decision Making

Conformal Language Modeling

Uncertainty Quantification for Clinical Outcome Predictions with (Large) Language Models

Large language model validity via enhanced conformal prediction methods

Efficient Conformal Prediction via Cascaded Inference with Expanded Admission

Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs

DeLLMa: Decision Making Under Uncertainty with Large Language Models

Conformal Prediction Regions for Time Series using Linear Complementarity Programming

Mitigating LLM Hallucinations via Conformal Abstention

Decision-Focused Uncertainty Quantification

Pareto Optimal Learning for Estimating Large Language Model Errors

A novel Deep Learning approach for one-step Conformal Prediction approximation

The Benefit of Being Bayesian in Online Conformal Prediction

Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions