Learning optimal objective values for MILP

Lara Scavuzzo,Karen Aardal,Neil Yorke-Smith
2024-11-27
Abstract:Modern Mixed Integer Linear Programming (MILP) solvers use the Branch-and-Bound algorithm together with a plethora of auxiliary components that speed up the search. In recent years, there has been an explosive development in the use of machine learning for enhancing and supporting these algorithmic components. Within this line, we propose a methodology for predicting the optimal objective value, or, equivalently, predicting if the current incumbent is optimal. For this task, we introduce a predictor based on a graph neural network (GNN) architecture, together with a set of dynamic features. Experimental results on diverse benchmarks demonstrate the efficacy of our approach, achieving high accuracy in the prediction task and outperforming existing methods. These findings suggest new opportunities for integrating ML-driven predictions into MILP solvers, enabling smarter decision-making and improved performance.
Optimization and Control,Artificial Intelligence,Machine Learning,Mathematical Software
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve two closely related problems in the process of solving mixed - integer linear programming (MILP): 1. **How to predict the optimal objective value?** - Problem description: Given an MILP instance, how to accurately predict its optimal objective value \( z^* \)? - Solution: The authors propose a method based on graph neural networks (GNN) to predict the optimal objective value. Specifically, they use the bipartite graph representation of the MILP instance and perform regression prediction through GNN. 2. **During the solving process, how to determine whether the current solution is the optimal solution?** - Problem description: In the Branch - and - Bound (B&B) solving process, how to predict whether the currently known feasible solution is the optimal solution? - Solution: The authors propose two classifiers for prediction: - One is a static classifier \( C_{\text{GNN}}^\epsilon \) based on the GNN regression model, which does not depend on the dynamic information in the B&B process. - The other is a classifier \( C_D \) based on the dynamic features collected during the solving process. These features include gap, tree weight, median gap, trend of open nodes, and ratio to GNN prediction. ### Research background Modern MILP solvers usually use the branch - and - bound algorithm and combine multiple auxiliary components to accelerate the search process. In recent years, machine learning techniques have been widely applied to enhance and support these algorithm components. The research focus of this paper is to use machine learning methods to predict the optimal objective value of MILP and determine whether the current solution is the optimal solution, thereby improving the performance of the solver. ### Method overview 1. **Optimal objective value prediction**: - Use GNN to process the bipartite graph representation of the MILP instance and output the prediction of the optimal objective value \( \tilde{z}^* \). - Test three different learning objectives: - \( \Theta_1 = z^* \) - \( \Theta_2=\frac{z^*}{z_{\text{LP}}} \) - \( \Theta_3 = z^* - z_{\text{LP}} \) - Where \( z_{\text{LP}} \) is the linear relaxation solution of the root node. 2. **Phase - transition prediction**: - Propose two classifiers: - \( C_{\text{GNN}}^\epsilon \): Based on the GNN regression model, use the parameter \( \epsilon \) to control the prediction confidence. - \( C_D \): A classifier based on dynamic features, using a logistic regression model. ### Experimental results - **Optimal objective value prediction**: - Tests were carried out on four benchmark datasets, and the results show that the objectives containing linear relaxation information (such as \( \Theta_2 \) and \( \Theta_3 \)) are more effective than directly predicting the optimal objective value (such as \( \Theta_1 \)). - The general model trained on the mixed dataset performs as well as the special - purpose model trained on specific - type instances, and even better in some cases. - **Phase - transition prediction**: - The performance of four predictors was compared, including the best - estimate and rank - 1 classifiers of Berthold et al., as well as the \( C_{\text{GNN}}^\epsilon \) and \( C_D \) classifiers proposed in this paper. - The results show that after adjusting the \( \epsilon \) parameter, the performance of \( C_{\text{GNN}}^\epsilon \) is better than other classifiers, and adding dynamic features further improves the prediction accuracy. ### Conclusion