Abstract:In this paper, we study learning-augmented algorithms for the Bahncard problem. The Bahncard problem is a generalization of the ski-rental problem, where a traveler needs to irrevocably and repeatedly decide between a cheap short-term solution and an expensive long-term one with an unknown future. Even though the problem is canonical, only a primal-dual-based learning-augmented algorithm was explicitly designed for it. We develop a new learning-augmented algorithm, named PFSUM, that incorporates both history and short-term future to improve online decision making. We derive the competitive ratio of PFSUM as a function of the prediction error and conduct extensive experiments to show that PFSUM outperforms the primal-dual-based algorithm.

What problem does this paper attempt to address?

This paper attempts to solve the online decision - making optimization in the Bahncard problem. Specifically, the Bahncard problem is a generalized version of the ski - rental problem, in which travelers need to repeatedly and irrevocably choose a cheap short - term solution and an expensive long - term solution under uncertain future situations. The key challenge of this problem is how to minimize the total cost (i.e., the purchase cost of Bahncard and train tickets) without knowing future travel requests. ### Main problem description Bahncard is a railway pass of the German railway company, which offers discounts on all train tickets within a fixed validity period. When there is a travel need, travelers can choose to buy tickets at the normal price or buy a Bahncard first and then enjoy the discount. The Bahncard problem is an online cost - minimization problem, and its goal is to minimize the total cost of Bahncard and ticket purchases without knowing future travel requests. ### Main contributions of the paper 1. **Introduction of the PFSUM algorithm**: The paper proposes a new learning - enhanced algorithm PFSUM, which combines historical information and short - term future predictions to improve online decision - making. 2. **Performance evaluation**: The author derives the competitive ratio of PFSUM and shows through a large number of experiments that PFSUM is superior to the existing learning - enhanced algorithms based on the primal - dual method. 3. **Theoretical analysis**: The paper analyzes in detail the competitive ratio of PFSUM and proves its performance under different prediction errors. ### Formal definition of the Bahncard problem The Bahncard problem is defined by three parameters, denoted as BP(C, β, T), where: - \(C\) is the cost of Bahncard; - \( \beta \) is the discount rate provided by Bahncard, that is, reducing any regular fare \(p\) to \( \beta p\), where \(0\leqslant\beta < 1\); - \(T\) is the validity period of Bahncard. When \( \beta = 0\) and \(T\rightarrow\infty\), the Bahncard problem degenerates into the classic ski - rental problem. ### Definition of the competitive ratio For a learning - enhanced online algorithm ALG, its competitive ratio \(CR_{ALG}(\eta)\) is a function of the prediction error \( \eta \). If \(CR_{ALG}(0)=\delta\), then ALG is said to be \( \delta \)-consistent; if for all \( \eta \), \(CR_{ALG}(\eta)\leqslant\vartheta\), then ALG is said to be \( \vartheta \)-robust. ### Working principle of the PFSUM algorithm When each travel request occurs and there is no valid Bahncard, the PFSUM algorithm will decide whether to buy a Bahncard according to the past time interval and future predictions. Specifically, PFSUM will buy a Bahncard when the following two conditions are simultaneously met: 1. The total fare in the past time interval \((t - T, t]\) is at least \( \gamma \), where \( \gamma:=\frac{C}{1 - \beta}\); 2. The total fare in the predicted future time interval \([t, t + T)\) is also at least \( \gamma \). ### Derivation of the competitive ratio Let the maximum prediction error be \( \eta \), then the competitive ratio \(CR_{PFSUM}(\eta)\) of PFSUM can be expressed as: \[ CR_{PFSUM}(\eta)= \begin{cases} \frac{2\gamma+(2 - \beta)\eta}{(1 + \beta)\gamma+\beta\eta}&\text{if }0\leqslant\eta\leqslant\gamma,\\ \frac{(3 - \beta)\gamma+\eta}{(1} \end{cases} \]

Learning-Augmented Algorithms for the Bahncard Problem

Learning-augmented Algorithms for Online Subset Sum

The Primal-Dual method for Learning Augmented Algorithms

Learning-augmented Online Algorithm for Two-level Ski-rental Problem

Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

Learning-Augmented Algorithms for Online Concave Packing and Convex Covering Problems

Online Learning and Optimization for Revenue Management Problems with Add-on Discounts

On Optimal Consistency-Robustness Trade-Off for Learning-Augmented Multi-Option Ski Rental

Online Algorithms for Multi-shop Ski Rental with Machine Learned Advice

Online Uniform Allocation:Randomized Learning-Augmented Approximation Algorithms with Application to Digital Health

Learning-Augmented Algorithms for Online Linear and Semidefinite Programming

Nonparametric Multi-Product Dynamic Pricing with Demand Learning Via Simultaneous Price Perturbation

Learning-Augmented Algorithms with Explicit Predictors

Online Algorithms for Multi-shop Ski Rental with Machine Learned Predictions

A Simple Learning-Augmented Algorithm for Online Packing with Concave Objectives

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

Primal-Dual Algorithms with Predictions for Online Bounded Allocation and Ad-Auctions Problems

Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes

Optimal Algorithms for Ski Rental with Soft Machine-Learned Predictions

Learning-Augmented Scheduling

Learning in Budgeted Auctions with Spacing Objectives