Abstract:One-shot non-autoregressive neural networks, different from RL-based ones, have been actively adopted for solving combinatorial optimization (CO) problems, which can be trained by the objective score in a self-supervised manner. Such methods have shown their superiority in efficiency (e.g. by parallelization) and potential for tackling predictive CO problems for decision-making under uncertainty. While the discrete constraints often become a bottleneck for gradient-based neural solvers, as currently handled in three typical ways: 1) adding a soft penalty in the objective, where a bounded violation of the constraints cannot be guaranteed, being critical to many constraint-sensitive scenarios; 2) perturbing the input to generate an approximate gradient in a black-box manner, though the constraints are exactly obeyed while the approximate gradients can hurt the performance on the objective score; 3) a compromise by developing soft algorithms whereby the output of neural networks obeys a relaxed constraint, and there can still occur an arbitrary degree of constraint-violation. Towards the ultimate goal of establishing a general framework for neural CO solver with the ability to control an arbitrary-small degree of constraint violation, in this paper, we focus on a more achievable and common setting: the cardinality constraints, which in fact can be readily encoded by a differentiable optimal transport (OT) layer. Based on this observation, we propose OT-based cardinality constraint encoding for end-to-end CO problem learning with two variants: Sinkhorn and Gumbel-Sinkhorn, whereby their violation of the constraints can be exactly characterized and bounded by our theoretical results. On synthetic and real-world CO problem instances, our methods surpass the state-of-the-art CO network and are comparable to (if not superior to) the commercial solver Gurobi. In particular, we further showcase a case study of applying our approach to the predictive portfolio optimization task on real-world asset price data, improving the Sharpe ratio from 1.1 to 2.0 of a strong LSTM+Gurobi baseline under the classic predict-then-optimize paradigm.

Data-efficient Supervised Learning is Powerful for Neural Combinatorial Optimization

Rethinking Supervised Learning Based Neural Combinatorial Optimization for Routing Problem

Self-Improved Learning for Scalable Neural Combinatorial Optimization

How Good is Neural Combinatorial Optimization? A Systematic Evaluation on the Traveling Salesman Problem

Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization

Learning to Solve Combinatorial Optimization under Positive Linear Constraints via Non-Autoregressive Neural Networks

Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization

Learning for Robust Combinatorial Optimization: Algorithm and Application

A Hybrid Neural Combinatorial Optimization Framework Assisted by Automated Algorithm Design

Neural Solver Selection for Combinatorial Optimization

Relaxed Combinatorial Optimization Networks with Self-Supervision: Theoretical and Empirical Notes on the Cardinality-Constrained Case

A Differentiable Approach to Combinatorial Optimization using Dataless Neural Networks

Neural Combinatorial Optimization with Reinforcement Learning

Solving Optimization Problems Through Fully Convolutional Networks: an Application to the Traveling Salesman Problem

From Distribution Learning in Training to Gradient Search in Testing for Combinatorial Optimization

Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization

Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case

A Reinforcement Learning Approach for Optimizing Multiple Traveling Salesman Problems over Graphs

ML4CO: Is GCNN All You Need? Graph Convolutional Neural Networks Produce Strong Baselines For Combinatorial Optimization Problems, If Tuned and Trained Properly, on Appropriate Data

Solving Large-Scale Multiobjective Optimization Problems With Sparse Optimal Solutions via Unsupervised Neural Networks

Deep Reinforcement Learning for Combinatorial Optimization: Covering Salesman Problems