Molecular CT: Unifying Geometry and Representation Learning for Molecules at Different Scales

Jun Zhang,Yao-Kun Lei,Yaqiang Zhou,Yi Isaac Yang,Yi Qin Gao
2023-12-26
Abstract:Deep learning is changing many areas in molecular physics, and it has shown great potential to deliver new solutions to challenging molecular modeling problems. Along with this trend arises the increasing demand of expressive and versatile neural network architectures which are compatible with molecular systems. A new deep neural network architecture, Molecular Configuration Transformer (Molecular CT), is introduced for this purpose. Molecular CT is composed of a relation-aware encoder module and a computationally universal geometry learning unit, thus able to account for the relational constraints between particles meanwhile scalable to different particle numbers and invariant with respect to the trans-rotational transforms. The computational efficiency and universality make Molecular CT versatile for a variety of molecular learning scenarios and especially appealing for transferable representation learning across different molecular systems. As examples, we show that Molecular CT enables representational learning for molecular systems at different scales, and achieves comparable or improved results on common benchmarks using a more light-weighted structure compared to baseline models.
Machine Learning,Soft Condensed Matter
What problem does this paper attempt to address?
The main goal of this paper is to propose a new deep neural network architecture—Molecular Configuration Transformer (Molecular CT)—to address the problem of unified geometric and relational representation learning for molecular systems at different scales. Specifically, the paper attempts to solve the following core issues: 1. **Unified handling of atomic and molecular systems**: Traditionally, quantum mechanics (QM) models at the atomic level and molecular mechanics (MM) models that consider intermolecular interactions and structural constraints are treated separately. This paper introduces a dual-representation framework to unify the handling of these two systems. 2. **Designing a general and efficient neural network architecture**: To effectively handle the aforementioned types of molecular systems, the authors developed a new architecture called the Molecular Configuration Transformer. This architecture consists of a relation-aware encoder module and a computationally universal geometric learning unit, which can simultaneously consider relational constraints between particles and can be extended to different numbers of particles while maintaining translational and rotational invariance. 3. **Achieving cross-scale representation learning**: The model aims to enable representation learning across different molecular scales, meaning it can flexibly handle molecular systems of varying complexity, from single atoms to entire proteins. 4. **Improving model expressiveness and transferability**: The method introduced in the paper not only improves the model's performance in predicting molecular properties but also enhances the model's transfer learning capability across different molecular systems, which is crucial for model generalization in practical applications. In summary, the main contribution of this paper is the proposal of a new model that can effectively and uniformly handle various molecular systems from the atomic level to the molecular level, while also improving the model's representation learning ability and adaptability across different systems.