EnviroDetaNet: Pretrained E (3)-equivariant Message-Passing Neural Networks with Multi-Level Molecular Representations for Organic Molecule Spectra Prediction

Yuzhi Xu, Daqian Bian, Cheng-Wei Ju, Fanyu Zhao, Pujun Xie, Yuanqing Wang, Wei Hu, Zhenrong Sun, John Zhang, Tong Zhu
2024-09-06
Abstract:Fast and accurate spectral prediction plays a crucial role in molecular design within fields such as pharmaceutical and materials science. Nevertheless, predicting molecular spectra typically requires quantum chemistry calculations, posing significant challenges for fast predictions and high-throughput screening. In this paper, we propose an equivariant, fast, and robust model, named EnviroDetaNet, which integrates molecular environment information. EnviroDetaNet employs an E(3)-equivariant message-passing neural network combining intrinsic atomic properties, spatial features, and environmental information, allowing it to comprehensively capture both local and global molecular information. Compared to state-of-the-art models, EnviroDetaNet excels in various predictive tasks and maintains high accuracy even with a 50% reduction in training data, demonstrating strong generalization capabilities. Ablation studies confirm that molecular environment information is crucial for improving model stability and accuracy. EnviroDetaNet also shows outstanding performance in spectral predictions for complex molecular systems, making it a powerful tool for accelerating molecular discovery.
What problem does this paper attempt to address?