Multi-scale Time-stepping of Partial Differential Equations with Transformers

AmirPouya Hemmasian,Amir Barati Farimani
2023-11-04
Abstract:Developing fast surrogates for Partial Differential Equations (PDEs) will accelerate design and optimization in almost all scientific and engineering applications. Neural networks have been receiving ever-increasing attention and demonstrated remarkable success in computational modeling of PDEs, however; their prediction accuracy is not at the level of full deployment. In this work, we utilize the transformer architecture, the backbone of numerous state-of-the-art AI models, to learn the dynamics of physical systems as the mixing of spatial patterns learned by a convolutional autoencoder. Moreover, we incorporate the idea of multi-scale hierarchical time-stepping to increase the prediction speed and decrease accumulated error over time. Our model achieves similar or better results in predicting the time-evolution of Navier-Stokes equations compared to the powerful Fourier Neural Operator (FNO) and two transformer-based neural operators OFormer and Galerkin Transformer.
Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to develop an efficient surrogate model based on the Transformer architecture to solve partial differential equations (PDEs). Specifically, the researchers utilize the Transformer architecture to learn the dynamics of physical systems and employ a convolutional autoencoder to learn the mixture of spatial patterns. Additionally, the paper introduces a multi-scale hierarchical time-stepping method to enhance prediction speed and reduce accumulated errors. The core objective of the paper is to achieve results comparable to or better than existing powerful models (such as the Fourier Neural Operator (FNO) and other Transformer-based neural operators) when solving time-dependent partial differential equations. Through these improvements, the model has achieved significant results in predicting the time evolution of the Navier-Stokes equations and has demonstrated fast computation during training and testing. However, for certain datasets (such as Kolmogorov Flow), the model failed to accurately capture the rapidly changing dynamic characteristics.