Abstract:Distributionally robust optimization (DRO) is a modeling framework in decision making under uncertainty in which the probability distribution of a random parameter is unknown although its partial information (e.g., statistical properties) is available. In this framework, the unknown probability distribution is assumed to lie in an ambiguity set consisting of all distributions that are compatible with the available partial information. Although DRO bridges the gap between stochastic programming and robust optimization, one of its limitations is that its models for large-scale problems can be significantly difficult to solve, especially when the uncertainty is of high dimension. In this paper, we propose computationally efficient inner and outer approximations for DRO problems under a piecewise linear objective function and with a moment-based ambiguity set and a combined ambiguity set including Wasserstein distance and moment information. In these approximations, we split a random vector into smaller pieces, leading to smaller matrix constraints. In addition, we use principal component analysis to shrink uncertainty space dimensionality. We quantify the quality of the developed approximations by deriving theoretical bounds on their optimality gap. We display the practical applicability of the proposed approximations in a production–transportation problem and a multiproduct newsvendor problem. The results demonstrate that these approximations dramatically reduce the computational time while maintaining high solution quality. The approximations also help construct an interval that is tight for most cases and includes the (unknown) optimal value for a large-scale DRO problem, which usually cannot be solved to optimality (or even feasibility in most cases). Summary of Contribution: This paper studies an important type of optimization problem, that is, distributionally robust optimization problems, by developing computationally efficient inner and outer approximations via operations research tools. Specifically, we consider several variants of such problems that are practically important and that admit tractable yet large-scale reformulation. We accordingly utilize random vector partition and principal component analysis to derive efficient approximations with smaller sizes, which, more importantly, provide a theoretical performance guarantee with respect to low optimality gaps. We verify the significant efficiency (i.e., reducing computational time while maintaining high solution quality) of our proposed approximations in solving both production–transportation and multiproduct newsvendor problems via extensive computing experiments.

Approximating Global Optimum for Probabilistic Truth Discovery

Approximate Truthful Mechanism Design For Two-Dimensional Orthogonal Knapsack Problem

Approximate Composable Truthful Mechanism Design

A Joint Maximum Likelihood Estimation Framework for Truth Discovery: A Unified Perspective

A Probabilistic Model for Truth Discovery with Object Correlations

Probabilistic model for truth discovery with mean and median check framework

A Survey on Truth Discovery

Approximating the Expected Values for Combinatorial Optimization Problems over Stochastic Points.

Approximating optimization problems in graphs with locational uncertainty

Convex Optimization for Linear Query Processing under Approximate Differential Privacy

Using Social Network Information in Bayesian Truth Discovery

Optimal False Discovery Rate Control for Large Scale Multiple Testing with Auxiliary Information

Truth Discovery in Data Streams: A Single-Pass Probabilistic Approach.

Hidden Markov Based Truth Discovery for Multi-Agent Labeling.

Fine-Tuned Convex Approximations of Probabilistic Reachable Sets under Data-driven Uncertainties

Utility Maximization under Uncertainty

Approximation and Heuristic Algorithms for Probabilistic Physical Search on General Graphs

Computationally Efficient Approximations for Distributionally Robust Optimization Under Moment and Wasserstein Ambiguity

Consensus Answers for Queries over Probabilistic Databases.

True mean value discovery over multiple data sources with unknown reliability degrees

Privacy-Preserving Truth Discovery in Crowd Sensing Systems