Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding

Tatsunori Taniai,Ryo Igarashi,Yuta Suzuki,Naoya Chiba,Kotaro Saito,Yoshitaka Ushiku,Kanta Ono
2024-03-18
Abstract:Predicting physical properties of materials from their crystal structures is a fundamental problem in materials science. In peripheral areas such as the prediction of molecular properties, fully connected attention networks have been shown to be successful. However, unlike these finite atom arrangements, crystal structures are infinitely repeating, periodic arrangements of atoms, whose fully connected attention results in infinitely connected attention. In this work, we show that this infinitely connected attention can lead to a computationally tractable formulation, interpreted as neural potential summation, that performs infinite interatomic potential summations in a deeply learned feature space. We then propose a simple yet effective Transformer-based encoder architecture for crystal structures called Crystalformer. Compared to an existing Transformer-based model, the proposed model requires only 29.4% of the number of parameters, with minimal modifications to the original Transformer architecture. Despite the architectural simplicity, the proposed method outperforms state-of-the-art methods for various property regression tasks on the Materials Project and JARVIS-DFT datasets.
Machine Learning,Materials Science,Computational Physics
What problem does this paper attempt to address?
This paper primarily discusses how to effectively utilize the Transformer architecture to address the encoding problem of crystal structures. In materials science, predicting the physical properties from crystal structures is a fundamental challenge. Despite the successful application of fully connected attention networks for predicting molecular properties, the periodicity of crystal structures, with their infinitely repeated atomic arrangements, makes it complex to employ fully connected attention. The paper proposes a novel approach called Crystalformer, which interprets this infinite connection attention as neural potential summation, summing the potential energies between atoms in an abstract feature space. This method simplifies the design of the Transformer encoder for crystal structures, reducing the number of parameters by approximately 29.4% compared to existing Transformer models, while maintaining the simplicity of the architecture. Crystalformer solves the issue of infinite connection by formalizing the attention weights as potential energies that decay with distance, making them computable. Experimental results demonstrate that despite its simplicity, the proposed method outperforms existing state-of-the-art approaches in various attribute regression tasks on the Materials Project and JARVIS-DFT datasets. In summary, the paper attempts to address the periodicity of crystal structures to improve the accuracy of predicting material physical properties. By introducing the Crystalformer model, it achieves more efficient and accurate encoding of crystal structures.