Moco: A Learnable Meta Optimizer for Combinatorial Optimization

Tim Dernedde, Daniela Thyssens, Sören Dittrich, Maximilian Stubbemann, Lars Schmidt-Thieme
2024-02-08
Abstract:Relevant combinatorial optimization problems (COPs) are often NP-hard. While they have been tackled mainly via handcrafted heuristics in the past, advances in neural networks have motivated the development of general methods to learn heuristics from data. Many approaches utilize a neural network to directly construct a solution, but are limited in further improving based on already constructed solutions at inference time. Our approach, Moco, learns a graph neural network that updates the solution construction procedure based on features extracted from the current search state. This meta training procedure targets the overall best solution found during the search procedure given information such as the search budget. This allows Moco to adapt to varying circumstances such as different computational budgets. Moco is a fully learnable meta optimizer that does not utilize any problem specific local search or decomposition. We test Moco on the Traveling Salesman Problem (TSP) and Maximum Independent Set (MIS) and show that it outperforms other approaches on MIS and is overall competitive on the TSP, especially outperforming related approaches, partially even if they use additional local search.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use neural networks to automatically learn effective heuristic algorithms in combinatorial optimization problems (COPs). Specifically, the paper proposes a method named MOCO (Meta Optimizer for Combinatorial Optimization), aiming to update the parameters in the solution construction process through graph neural networks (GNNs) so as to find better solutions during the search process. This method not only takes into account the characteristics of the current search state but also factors such as the remaining computational budget, enabling MOCO to adapt to different computational conditions and show competitiveness in different combinatorial optimization problems. ### Background of the Paper and Problem Description Combinatorial optimization problems are the basis of many practical applications, but they usually have NP - hard properties, which means that it is difficult to find exact solutions in polynomial time. Therefore, for a long time, the main method to solve such problems has been to use heuristics designed by hand. However, these heuristic algorithms are often customized for specific problems and lack generalization ability. In recent years, with the development of neural network technology, researchers have begun to explore methods of automatically learning heuristic algorithms from data to improve the universality and efficiency of algorithms. ### Core Contributions of MOCO 1. **Fully - learnable Meta - optimizer**: MOCO proposes a fully - learnable meta - optimizer that does not rely on any local search or decomposition strategy for specific problems. Through graph neural networks, MOCO can dynamically update the parameters in the solution construction process according to the characteristics of the current search state (such as the constructed solution, problem instance, etc.) and the remaining optimization budget. 2. **Adapt to Different Computational Budgets**: MOCO can adjust its strategy under different computational budgets to achieve the best solution. This makes MOCO not only perform well under the budget conditions used during training but also be able to adapt to larger budgets during inference, thereby finding better solutions. 3. **Effectiveness on Multiple Combinatorial Optimization Problems**: The paper tests the effect of MOCO on the Traveling Salesman Problem (TSP) and the Maximum Independent Set (MIS). The experimental results show that MOCO outperforms other methods on the MIS problem and also shows competitiveness on the TSP problem, especially without using additional local search. ### Method Overview The workflow of MOCO can be summarized as the following steps: 1. **Initialization**: Given a combinatorial optimization problem instance (represented as a graph), first use a graph neural network (GNN) to initialize the parameter vector \(\theta\). 2. **Solution Construction**: Sample a batch of solutions from \(\theta\). The construction process of each solution is a sequential decision - making process, where each step determines the next decision variable according to the value of \(\theta\). 3. **Feature Extraction**: Based on the constructed solutions and previous iterations, construct a set of features. 4. **Parameter Update**: Use another graph neural network (GNN) to update \(\theta\) according to the extracted features. This update process takes into account the current \(\theta\), its gradient, the problem instance, and the remaining optimization budget. 5. **Repeat the Above Steps**: The above process is repeated until the predetermined budget \(K\) is reached. ### Experimental Results The experimental results show that MOCO can effectively find high - quality solutions under different computational budgets. Especially on the TSP and MIS problems, the performance of MOCO is better than or at least comparable to existing methods. In addition, MOCO can adapt to larger budgets during inference, further improving the quality of solutions. In conclusion, by introducing a fully - learnable meta - optimizer, MOCO solves the problem of how to automatically learn effective heuristic algorithms in combinatorial optimization problems and shows its superiority on multiple benchmark problems.