Abstract:Self-supervised neural networks for combinatorial optimization (CO) handle non-differentiable constraints via relaxation. Despite their superiority in efficiency, one possible limitation is that these methods often put the constraints as soft penalty terms in the learning objective, and the degree of constraint-violation usually cannot be accurately or directly modulated. In this paper, we aim to develop a new paradigm to solve the CO problem by incorporating the constraints into the network architecture and computational operators, which is a more natural learning pipeline and decouples the constraint violation penalty from the raw objective optimization. Seeing such a paradigm may be rather general such that there only exist perturbation-based blackbox differentiable learning methods as generic solvers in literature, here we consider the commonly used cardinality constraints which in fact can incorporate many existing CO problem instances as its special cases. Specifically, the cardinality constraints are encoded by a differentiable optimal transport layer. We theoretically characterize the constraint-violations of two variants of our architecture (w.r.t. existing CO network whose constraint-violation is non-controlled), and we further show that their empirical performances are in line with our theoretical results. On self-supervised learning of pure CO problems on synthetic and real-world data, our networks surpass the state-of-the-art CO network, and are comparable to Gurobi and can sometimes even surpass. Our general paradigm also enables the application of end-to-end predictive portfolio optimization on real-world asset price data, improving the Sharpe ratio from 1.1 to 2.1 with a predict-then-optimize paradigm with LSTM+Gurobi.

Rethinking Supervised Learning Based Neural Combinatorial Optimization for Routing Problem

Data-efficient Supervised Learning is Powerful for Neural Combinatorial Optimization

Self-Improved Learning for Scalable Neural Combinatorial Optimization

How Good is Neural Combinatorial Optimization? A Systematic Evaluation on the Traveling Salesman Problem

Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with Perspectives

Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning

Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt

Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization

A Reinforcement Learning Approach for Optimizing Multiple Traveling Salesman Problems over Graphs

A Hybrid Neural Combinatorial Optimization Framework Assisted by Automated Algorithm Design

Learning to Solve Combinatorial Optimization under Positive Linear Constraints via Non-Autoregressive Neural Networks

Neural Combinatorial Optimization with Reinforcement Learning

Prompt Learning for Generalized Vehicle Routing

Learning for Robust Combinatorial Optimization: Algorithm and Application

Neural Solver Selection for Combinatorial Optimization

Learning to Solve Combinatorial Optimization Problems on Real-World Graphs in Linear Time

Relaxed Combinatorial Optimization Networks with Self-Supervision: Theoretical and Empirical Notes on the Cardinality-Constrained Case

Discovering Lin-Kernighan-Helsgaun Heuristic for Routing Optimization Using Self-Supervised Reinforcement Learning

Multiobjective Combinatorial Optimization Using a Single Deep Reinforcement Learning Model

Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization

Attention, Learn to Solve Routing Problems!