Score Function Gradient Estimation to Widen the Applicability of Decision-Focused Learning

Mattia Silvestri,Senne Berden,Jayanta Mandi,Ali İrfan Mahmutoğulları,Brandon Amos,Tias Guns,Michele Lombardi
2024-06-16
Abstract:Many real-world optimization problems contain parameters that are unknown before deployment time, either due to stochasticity or to lack of information (e.g., demand or travel times in delivery problems). A common strategy in such cases is to estimate said parameters via machine learning (ML) models trained to minimize the prediction error, which however is not necessarily aligned with the downstream task-level error. The decision-focused learning (DFL) paradigm overcomes this limitation by training to directly minimize a task loss, e.g. regret. Since the latter has non-informative gradients for combinatorial problems, state-of-the-art DFL methods introduce surrogates and approximations that enable training. But these methods exploit specific assumptions about the problem structures (e.g., convex or linear problems, unknown parameters only in the objective function). We propose an alternative method that makes no such assumptions, it combines stochastic smoothing with score function gradient estimation which works on any task loss. This opens up the use of DFL methods to nonlinear objectives, uncertain parameters in the problem constraints, and even two-stage stochastic optimization. Experiments show that it typically requires more epochs, but that it is on par with specialized methods and performs especially well for the difficult case of problems with uncertainty in the constraints, in terms of solution quality, scalability, or both.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper primarily studies the Decision-Focused Learning (DFL) method in the context of the predict-then-optimize problem. Specifically, it addresses the issue of parameter uncertainty in real-world optimization problems, particularly in scenarios such as manufacturing or logistics where decisions need to be made based on uncertain demand or traffic conditions. The core contribution of the paper is the proposal of a new method to solve optimization problems where unknown parameters appear not only in the objective function but also in the constraints. Traditional decision-focused learning methods often assume that unknown parameters only appear in the objective function, and these methods do not sufficiently handle gradient information for combinatorial optimization problems. Therefore, the authors propose a method that combines random smoothing with Score Function Gradient Estimation (SFGE) to overcome these issues. The main advantage of this method is that it does not rely on the specific structure of the problem, such as whether it is a convex or linear problem, nor does it require the unknown parameters to appear only in the objective function. Through random smoothing, a smoother loss function can be obtained, which in turn provides useful gradient information for training. SFGE allows the computation of the gradient of this smoothed loss function with respect to the parameters, thereby achieving the goal of decision-focused learning. The experimental section validates the effectiveness and broad applicability of the proposed method. Particularly when unknown parameters appear in the constraints, the new method shows significant advantages in solution quality and scalability compared to existing decision-focused learning methods. Additionally, the paper discusses the potential application value of this method when applied to two-stage stochastic optimization problems. In summary, this paper aims to address the challenge in the predict-then-optimize problem of how to effectively use machine learning models to predict unknown parameters to minimize task loss, especially when these unknown parameters appear in the constraints of the optimization problem.