Gini in a Bottleneck: Sparse Molecular Representations for Graph Convolutional Neural Networks

Ryan Henderson,Djork-Arné Clevert,Floriane Montanari
DOI: https://doi.org/10.48550/arXiv.2010.04535
2020-11-24
Abstract:Due to the nature of deep learning approaches, it is inherently difficult to understand which aspects of a molecular graph drive the predictions of the network. As a mitigation strategy, we constrain certain weights in a multi-task graph convolutional neural network according to the Gini index to maximize the "inequality" of the learned representations. We show that this constraint does not degrade evaluation metrics for some targets, and allows us to combine the outputs of the graph convolutional operation in a visually interpretable way. We then perform a proof-of-concept experiment on quantum chemistry targets on the public QM9 dataset, and a larger experiment on ADMET targets on proprietary drug-like molecules. Since a benchmark of explainability in the latter case is difficult, we informally surveyed medicinal chemists within our organization to check for agreement between regions of the molecule they and the model identified as relevant to the properties in question.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the interpretability of graph convolutional neural networks (GCNs) when predicting molecular properties. Although multi - task graph convolutional neural networks perform well in predicting certain molecular attributes, such as drug screening, their black - box nature hinders their wider application in the chemistry community. To alleviate this problem, the authors introduce a regularization method based on the Gini coefficient. By constraining specific weights to maximize the "inequality" of the learned representations, the number of relevant node - level aggregations that need to be examined is reduced to explain the model's predictions. This method not only does not degrade the evaluation metrics of some targets but also allows the combination of the outputs of graph convolution operations in a visually interpretable way. The authors verified the effectiveness of this method through experiments on quantum - chemical targets in the public QM9 dataset and ADMET targets on proprietary drug - like molecules. In addition, the authors further verified the interpretive ability of the model through an internal survey of medicinal chemists' recognition of relevant regions in molecules.