Molecular topological deep learning for polymer property prediction

Cong Shen,Yipeng Zhang,Fei Han,Kelin Xia
2024-10-07
Abstract:Accurate and efficient prediction of polymer properties is of key importance for polymer design. Traditional experimental tools and density function theory (DFT)-based simulations for polymer property evaluation, are both expensive and time-consuming. Recently, a gigantic amount of graph-based molecular models have emerged and demonstrated huge potential in molecular data analysis. Even with the great progresses, these models tend to ignore the high-order and mutliscale information within the data. In this paper, we develop molecular topological deep learning (Mol-TDL) for polymer property analysis. Our Mol-TDL incorporates both high-order interactions and multiscale properties into topological deep learning architecture. The key idea is to represent polymer molecules as a series of simplicial complices at different scales and build up simplical neural networks accordingly. The aggregated information from different scales provides a more accurate prediction of polymer molecular properties.
Materials Science,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem this paper attempts to address is: how to efficiently and accurately predict the properties of polymers. Traditional methods such as experimental tools and density functional theory (DFT)-based simulations can provide accurate results, but they are costly and time-consuming. In recent years, graph-based molecular models have shown great potential in molecular data analysis, but these models often overlook higher-order and multi-scale information in the data. Therefore, this paper proposes a new method—Molecular Topological Deep Learning (Mol-TDL), which aims to more accurately predict the properties of polymer molecules by combining higher-order interactions and multi-scale characteristics. Specifically, the main contributions of the Mol-TDL method include: 1. **Multi-scale representation**: Representing polymer molecules through a series of simplicial complexes at different scales to capture higher-order and multi-scale interactions. 2. **Simplicial neural networks**: Constructing corresponding simplicial neural networks for each generated simplicial complex and aggregating information from different scales to improve prediction accuracy. 3. **Multi-scale topological contrastive learning**: Developing a multi-scale topological contrastive learning model for self-supervised pre-training to optimize message passing between simplicial complexes. Through these innovations, Mol-TDL achieves state-of-the-art performance on multiple benchmark datasets, particularly excelling in the prediction of electronic, optical, and thermodynamic properties.