Abstract:One-shot non-autoregressive neural networks, different from RL-based ones, have been actively adopted for solving combinatorial optimization (CO) problems, which can be trained by the objective score in a self-supervised manner. Such methods have shown their superiority in efficiency (e.g. by parallelization) and potential for tackling predictive CO problems for decision-making under uncertainty. While the discrete constraints often become a bottleneck for gradient-based neural solvers, as currently handled in three typical ways: 1) adding a soft penalty in the objective, where a bounded violation of the constraints cannot be guaranteed, being critical to many constraint-sensitive scenarios; 2) perturbing the input to generate an approximate gradient in a black-box manner, though the constraints are exactly obeyed while the approximate gradients can hurt the performance on the objective score; 3) a compromise by developing soft algorithms whereby the output of neural networks obeys a relaxed constraint, and there can still occur an arbitrary degree of constraint-violation. Towards the ultimate goal of establishing a general framework for neural CO solver with the ability to control an arbitrary-small degree of constraint violation, in this paper, we focus on a more achievable and common setting: the cardinality constraints, which in fact can be readily encoded by a differentiable optimal transport (OT) layer. Based on this observation, we propose OT-based cardinality constraint encoding for end-to-end CO problem learning with two variants: Sinkhorn and Gumbel-Sinkhorn, whereby their violation of the constraints can be exactly characterized and bounded by our theoretical results. On synthetic and real-world CO problem instances, our methods surpass the state-of-the-art CO network and are comparable to (if not superior to) the commercial solver Gurobi. In particular, we further showcase a case study of applying our approach to the predictive portfolio optimization task on real-world asset price data, improving the Sharpe ratio from 1.1 to 2.0 of a strong LSTM+Gurobi baseline under the classic predict-then-optimize paradigm.

Relaxed Combinatorial Optimization Networks with Self-Supervision: Theoretical and Empirical Notes on the Cardinality-Constrained Case

Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case

Learning to Solve Combinatorial Optimization under Positive Linear Constraints via Non-Autoregressive Neural Networks

Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More

Self-Improved Learning for Scalable Neural Combinatorial Optimization

A unified pre-training and adaptation framework for combinatorial optimization on graphs

A discrete-time neural network for optimization problems with hybrid constraints.

ROCO: A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs

A Relaxed Optimization Approach for Cardinality-Constrained Portfolio Optimization

How Good is Neural Combinatorial Optimization? A Systematic Evaluation on the Traveling Salesman Problem

Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization

A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs

Controlling Continuous Relaxation for Combinatorial Optimization

Learning for Robust Combinatorial Optimization: Algorithm and Application

Constrained Combinatorial Optimization with Reinforcement Learning

Constrained Bayesian Optimization with Adaptive Active Learning of Unknown Constraints

From Distribution Learning in Training to Gradient Search in Testing for Combinatorial Optimization

Continuous Tensor Relaxation for Finding Diverse Solutions in Combinatorial Optimization Problems

Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization

Multi-Objective Combinatorial Optimization Algorithm Based on Asynchronous Advantage Actor–Critic and Graph Transformer Networks