Neural Improvement Heuristics for Graph Combinatorial Optimization Problems

Andoni I. Garmendia,Josu Ceberio,Alexander Mendiburu
DOI: https://doi.org/10.1109/TNNLS.2023.3314375
2023-10-07
Abstract:Recent advances in graph neural network architectures and increased computation power have revolutionized the field of combinatorial optimization (CO). Among the proposed models for CO problems, Neural Improvement (NI) models have been particularly successful. However, existing NI approaches are limited in their applicability to problems where crucial information is encoded in the edges, as they only consider node features and node-wise positional encodings. To overcome this limitation, we introduce a novel NI model capable of handling graph-based problems where information is encoded in the nodes, edges, or both. The presented model serves as a fundamental component for hill-climbing-based algorithms that guide the selection of neighborhood operations for each iteration. Conducted experiments demonstrate that the proposed model can recommend neighborhood operations that outperform conventional versions for the Preference Ranking Problem with a performance in the 99th percentile. We also extend the proposal to two well-known problems: the Traveling Salesman Problem and the Graph Partitioning Problem, recommending operations in the 98th and 97th percentile, respectively.
Artificial Intelligence,Discrete Mathematics,Machine Learning
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve This paper aims to address a key challenge in Combinatorial Optimization (CO): existing Neural Improvement (NI) methods typically consider only node features when dealing with graph-structured problems, neglecting edge information. This leads to suboptimal performance in problems where edge information is crucial, such as the Preference Ranking Problem (PRP) and the Graph Partitioning Problem (GPP). Specifically, the paper proposes a new NI model capable of handling information stored on nodes, edges, or both in graph structures. This model can serve as a core component of hill-climbing-based algorithms to guide the selection of neighborhood operations at each step. Experimental validation shows that this model performs excellently on multiple combinatorial optimization problems, including PRP, the Traveling Salesman Problem (TSP), and GPP, with recommended operation performances reaching the 99th, 98th, and 97th percentiles, respectively. ### Background and Motivation Combinatorial optimization problems are prevalent in various real-world domains such as logistics, manufacturing, and biology. Since most combinatorial optimization problems are NP-hard, finding the optimal solution using exact methods becomes impractical as the problem size increases. Therefore, heuristic and metaheuristic methods have become the primary means of solving these problems over the past few decades. Although metaheuristic methods have become mainstream tools, they are evaluation-intensive algorithms that require evaluating thousands or even millions of candidate solutions, which is impractical in budget-constrained or online streaming optimization scenarios. In recent years, neural network (NN)-based methods have shown outstanding performance in various machine learning tasks, attracting attention from researchers in different fields. These methods can be divided into two categories: constructive methods and improvement methods. ### Limitations of Existing Methods Existing NI methods mainly focus on node features, ignoring edge information. However, in certain problems such as PRP and GPP, critical information may be stored on edges or both nodes and edges. Therefore, existing NI methods cannot fully utilize all available information, limiting their effectiveness in these problems. ### Main Contributions of the Paper 1. **Proposed a new NI model**: This model can handle information stored on nodes, edges, or both in graph structures, extending the application scope of existing NI methods. 2. **Demonstrated the model's diversity and effectiveness**: Through experiments on multiple combinatorial optimization problems such as PRP, TSP, and GPP, the model's superior performance was proven. It can be used alone or as part of other classic local search techniques like multi-start hill climbing, tabu search, or iterated local search. ### Method Overview The paper formalizes the process of solving graph problems as a Markov Decision Process (MDP), where the policy π is responsible for selecting action at at each step t based on the given state st. The main components of the model include: - **State**: Represents the information of the environment at step t, including static data (instance of the problem to be solved) and dynamic data (current solution ωt). - **Action**: The action at selected by the policy involves a pair of items in the current solution that need to be modified according to the policy. Once selected, an operator is applied to modify the current solution. - **Reward**: The transition from state st to st+1 is derived by applying an operator to a pair of items. The reward function represents the improvement in solution quality between states. ### Experimental Results Experimental results show that the proposed NI model performs excellently on PRP, TSP, and GPP, with recommended operation performances reaching the 99th, 98th, and 97th percentiles, respectively. This indicates that the model can effectively utilize both node and edge information in graph structures and has broad application potential in various combinatorial optimization problems.