Learning to solve graph metric dimension problem based on graph contrastive learning
Jian Wu,Li Wang,Weihua Yang,Haixia Zhao,Rui Wang,Jianji Cao,Fuhong Wei
DOI: https://doi.org/10.1007/s10489-023-05130-1
IF: 5.3
2023-01-01
Applied Intelligence
Abstract:Deep learning has been widely used to solve graph and combinatorial optimization problems. However, proper model deployment is critical for training a model and solving all problems. Existing frameworks mainly use reinforcement learning to learn to solve combinatorial optimization problems, in which a partial solution of the problem is regarded as an environmental state and each vertex of the corresponding graph is regarded as an action. As a result, using the sample data in model training effectively is challenging for different graphs. This study proposes a sampling-based, data-driven and distributed independent graph learning framework, based on decoupling graph structure learning and problem solving processes. To some extent, it facilitates industrial applications. Specifically, the framework consists of two independent parts: extracting graph structure and learning to solve the problem. Under this framework, the graph contrastive learning(GCL) is used to finish the graph structure learning process. Then by means of state-value aggregation on all of nodes in graphs, a global reinforcement learning method is established to learn to solve the graph problem, associated with repair policies to get improvement of performance. Experiments on synthetic graph datasets show that the graph contrastive learning is beneficial or has some advantages for training stability and improving the accuracy of solving the graph problem, and that the repair policies are stable for solution search. However, it also demonstrates that the graph neural network is not necessarily needed in the process of learning to solve the graph problem. Moreover, learning to solve MDP still has some challenges, such as decreasing learning performance with increasing edge existence probability of graphs, and it is unknown what kind of reward function is appropriate for solving MDP.