Abstract:In recent years, applying deep learning to solve physics problems has attracted much attention. Data-driven deep learning methods produce fast numerical operators that can learn approximate solutions to the whole system of partial differential equations (i.e., surrogate modeling). Although these neural networks may have lower accuracy than traditional numerical methods, they, once trained, are orders of magnitude faster at inference. Hence, one crucial feature is that these operators can generalize to unseen PDE parameters without expensive <a class="link-external link-http" href="http://re-training.In" rel="external noopener nofollow">this http URL</a> this paper, we construct CFDBench, a benchmark tailored for evaluating the generalization ability of neural operators after training in computational fluid dynamics (CFD) problems. It features four classic CFD problems: lid-driven cavity flow, laminar boundary layer flow in circular tubes, dam flows through the steps, and periodic Karman vortex street. The data contains a total of 302K frames of velocity and pressure fields, involving 739 cases with different operating condition parameters, generated with numerical methods. We evaluate the effectiveness of popular neural operators including feed-forward networks, DeepONet, FNO, U-Net, etc. on CFDBnech by predicting flows with non-periodic boundary conditions, fluid properties, and flow domain shapes that are not seen during training. Appropriate modifications were made to apply popular deep neural networks to CFDBench and enable the accommodation of more changing inputs. Empirical results on CFDBench show many baseline models have errors as high as 300% in some problems, and severe error accumulation when performing autoregressive inference. CFDBench facilitates a more comprehensive comparison between different neural operators for CFD compared to existing benchmarks.

Fluid: Dataset Abstraction and Elastic Acceleration for Cloud-native Deep Learning Training Jobs

High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms

Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters

Dynamic Resource Allocation for Deep Learning Clusters with Separated Compute and Storage

ElasticFlow: an Elastic Serverless Training Platform for Distributed Deep Learning.

DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training.

Cloudless-Training: A Framework to Improve Efficiency of Geo-Distributed ML Training

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.

FanStore: Enabling Efficient and Scalable I/O for Distributed Deep Learning

Efficient Device Scheduling with Multi-Job Federated Learning

FluidsNet: End-to-end Learning for Lagrangian Fluid Simulation

GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads

Energy-Efficient GPU Clusters Scheduling for Deep Learning

FfDL : A Flexible Multi-tenant Deep Learning Platform

Vapor: A GPU Sharing Scheduler with Communication and Computation Pipeline for Distributed Deep Learning

Bigflow: A General Optimization Layer for Distributed Computing Frameworks

CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics

FluidPlaying: Efficient Adaptive Simulation for Highly Dynamic Fluid

FLUID: A Unified Evaluation Framework for Flexible Sequential Data

VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling

An Optimal Resource Allocator of Elastic Training for Deep Learning Jobs on Cloud