Abstract:Many real-world optimization problems contain parameters that are unknown before deployment time, either due to stochasticity or to lack of information (e.g., demand or travel times in delivery problems). A common strategy in such cases is to estimate said parameters via machine learning (ML) models trained to minimize the prediction error, which however is not necessarily aligned with the downstream task-level error. The decision-focused learning (DFL) paradigm overcomes this limitation by training to directly minimize a task loss, e.g. regret. Since the latter has non-informative gradients for combinatorial problems, state-of-the-art DFL methods introduce surrogates and approximations that enable training. But these methods exploit specific assumptions about the problem structures (e.g., convex or linear problems, unknown parameters only in the objective function). We propose an alternative method that makes no such assumptions, it combines stochastic smoothing with score function gradient estimation which works on any task loss. This opens up the use of DFL methods to nonlinear objectives, uncertain parameters in the problem constraints, and even two-stage stochastic optimization. Experiments show that it typically requires more epochs, but that it is on par with specialized methods and performs especially well for the difficult case of problems with uncertainty in the constraints, in terms of solution quality, scalability, or both.

What problem does this paper attempt to address?

The paper primarily studies the Decision-Focused Learning (DFL) method in the context of the predict-then-optimize problem. Specifically, it addresses the issue of parameter uncertainty in real-world optimization problems, particularly in scenarios such as manufacturing or logistics where decisions need to be made based on uncertain demand or traffic conditions. The core contribution of the paper is the proposal of a new method to solve optimization problems where unknown parameters appear not only in the objective function but also in the constraints. Traditional decision-focused learning methods often assume that unknown parameters only appear in the objective function, and these methods do not sufficiently handle gradient information for combinatorial optimization problems. Therefore, the authors propose a method that combines random smoothing with Score Function Gradient Estimation (SFGE) to overcome these issues. The main advantage of this method is that it does not rely on the specific structure of the problem, such as whether it is a convex or linear problem, nor does it require the unknown parameters to appear only in the objective function. Through random smoothing, a smoother loss function can be obtained, which in turn provides useful gradient information for training. SFGE allows the computation of the gradient of this smoothed loss function with respect to the parameters, thereby achieving the goal of decision-focused learning. The experimental section validates the effectiveness and broad applicability of the proposed method. Particularly when unknown parameters appear in the constraints, the new method shows significant advantages in solution quality and scalability compared to existing decision-focused learning methods. Additionally, the paper discusses the potential application value of this method when applied to two-stage stochastic optimization problems. In summary, this paper aims to address the challenge in the predict-then-optimize problem of how to effectively use machine learning models to predict unknown parameters to minimize task loss, especially when these unknown parameters appear in the constraints of the optimization problem.

Score Function Gradient Estimation to Widen the Applicability of Decision-Focused Learning

Decision-Focused Learning without Differentiable Optimization: Learning Locally Optimized Decision Losses

Decision-Focused Learning: Foundations, State of the Art, Benchmark and Future Opportunities

Locally Convex Global Loss Network for Decision-Focused Learning

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

DiCE: The Infinitely Differentiable Monte-Carlo Estimator

Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization

Decision-Focused Learning to Predict Action Costs for Planning

Decision-Focused Forecasting: Decision Losses for Multistage Optimisation

Decision-Focused Learning with Directional Gradients

Robust Losses for Decision-Focused Learning

Differentiation of Multi-objective Data-driven Decision Pipeline

On the Robustness of Decision-Focused Learning

Decision-Oriented Learning with Differentiable Submodular Maximization for Vehicle Routing Problem

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

Deterministic Langevin Unconstrained Optimization with Normalizing Flows

Score-Based Methods for Discrete Optimization in Deep Learning

Deep learning as an alternative to global optimization in diffusion model for conflict tasks

Learning Linear Programs from Optimal Decisions

Efficient Learning of Generative Models via Finite-Difference Score Matching

A Note on Task-Aware Loss via Reweighing Prediction Loss by Decision-Regret