Comparison of Molecule Graph Representation with Similarity Consistency.

Bei Wang,Xiaoqing Lyu,Zhi Tang,Yifan Wang
DOI: https://doi.org/10.1109/bibm47256.2019.8982931
2019-01-01
Abstract:Many research tasks in the bioinformatics area, such as bioactivity prediction, de novo molecular design, and synthesis prediction, increasingly utilize graph-similarity-based analysis, which relies on selecting graph representation methods. However, most studies focus on the improvement of the final results of the tasks, and less attention has been paid on the differences of the distribution of the result of graph representation, (e.g. the embedded vectors) and how to evaluate them. In this paper, we propose a metric, mean degree of consistency (MDC), to evaluate the different approaches of graph representation with the help of retrieval results. To implement MDC, we introduce a serialized matching matrix and an optimization method based on partial sequence matching. We evaluate the efficiency and reliability of MDC with a series of experiments on graph-similarity-based retrievals for molecule graphs (MGs). Our result is helpful to the graph researchers for selecting graph representation methods.
What problem does this paper attempt to address?