Optimization Over Trained Neural Networks: Taking a Relaxing Walk

Jiatai Tong,Junyang Cai,Thiago Serra
2024-01-29
Abstract:Besides training, mathematical optimization is also used in deep learning to model and solve formulations over trained neural networks for purposes such as verification, compression, and optimization with learned constraints. However, solving these formulations soon becomes difficult as the network size grows due to the weak linear relaxation and dense constraint matrix. We have seen improvements in recent years with cutting plane algorithms, reformulations, and an heuristic based on Mixed-Integer Linear Programming (MILP). In this work, we propose a more scalable heuristic based on exploring global and local linear relaxations of the neural network model. Our heuristic is competitive with a state-of-the-art MILP solver and the prior heuristic while producing better solutions with increases in input, depth, and number of neurons.
Optimization and Control,Machine Learning
What problem does this paper attempt to address?
This paper attempts to address the challenges encountered in performing mathematical optimization on trained neural networks, especially the problem that as the network scale increases, the optimization problem becomes difficult to solve. Specifically, the paper focuses on how to solve the optimization problems related to trained neural networks more effectively, which are usually used for verification, compression, and optimization with learning constraints. The authors point out that due to weak linear relaxation and dense constraint matrices, it becomes very difficult to solve these optimization problems as the network size grows. Although some progress has been made in recent years through cutting - plane algorithms, reformulations, and heuristic methods based on mixed - integer linear programming (MILP), these problems still exist. To meet these challenges, the paper proposes a new heuristic method based on exploring the global and local linear relaxations of the neural network model. This method can produce better solutions when the input, depth, and number of neurons increase, and can be comparable in performance to the state - of - the - art MILP solvers and previous heuristic methods. Through this method, the authors aim to improve the scalability and efficiency of solving large - scale neural network optimization problems.