Evaluating Self-Supervised Learning for Molecular Graph Embeddings

Hanchen Wang,Jean Kaddour,Shengchao Liu,Jian Tang,Joan Lasenby,Qi Liu
2023-10-18
Abstract:Graph Self-Supervised Learning (GSSL) provides a robust pathway for acquiring embeddings without expert labelling, a capability that carries profound implications for molecular graphs due to the staggering number of potential molecules and the high cost of obtaining labels. However, GSSL methods are designed not for optimisation within a specific domain but rather for transferability across a variety of downstream tasks. This broad applicability complicates their evaluation. Addressing this challenge, we present "Molecular Graph Representation Evaluation" (MOLGRAPHEVAL), generating detailed profiles of molecular graph embeddings with interpretable and diversified attributes. MOLGRAPHEVAL offers a suite of probing tasks grouped into three categories: (i) generic graph, (ii) molecular substructure, and (iii) embedding space properties. By leveraging MOLGRAPHEVAL to benchmark existing GSSL methods against both current downstream datasets and our suite of tasks, we uncover significant inconsistencies between inferences drawn solely from existing datasets and those derived from more nuanced probing. These findings suggest that current evaluation methodologies fail to capture the entirety of the landscape.
Machine Learning,Quantitative Methods
What problem does this paper attempt to address?
The paper aims to address the evaluation issue of molecular graph embeddings in Graph Self-Supervised Learning (GSSL). Specifically, the paper focuses on the following aspects: 1. **Evaluation Complexity**: Current GSSL methods are designed for transferability across multiple downstream tasks rather than optimization for specific domains, which complicates the evaluation. 2. **Proposing a New Evaluation Framework**: The paper introduces "Molecular Graph Representation Evaluation" (MOLGRAPH EVAL), a framework that includes three types of probing tasks: general graph properties, molecular substructure properties, and embedding space properties, to generate a detailed analysis of molecular graph embedding characteristics. 3. **Revealing Inconsistencies**: When benchmarking existing GSSL methods using MOLGRAPH EVAL, significant discrepancies were found between conclusions drawn from existing datasets and the more detailed probing task results, indicating that current evaluation methods fail to comprehensively capture the actual performance of these embeddings. In summary, this paper attempts to improve the effectiveness evaluation of molecular graph self-supervised learning methods by introducing a comprehensive evaluation framework and reveals the shortcomings of current evaluation methods.