François PacaudMichel SchanenSungho ShinDaniel Adrian MaldonadoMihai Anitescua Centre Automatique et Systèmes,Mines Paris–PSL,Paris,Franceb Mathematics and Computer Science Department,Argonne National Laboratory,Lemont,IL,USAFrançois Pacaud is an assistant professor at Mines Paris-PSL,in the Centre Automatique et Systèmes (CAS). He obtained his M.Sc. in 2015 from Mines Paris-PSL,and in 2018 his Ph.D. in applied mathematics from the École des Ponts ParisTech,Paris,France. He was a postdoctoral fellow in the Compute Science Division at Argonne National Laboratory between 2020 and 2022. He his an expert in nonlinear optimization and numerical analysis for energy systems.Michel Schanen obtained is Ph.D. in automatic differentiation of MPI simulations from RWTH Aachen University,Germany. He joined Argonne as a Postdoctoral Appointee investigating large-scale adjoints in the fluid dynamics code Nek5000. In 2017,his research changed to optimization for power grid simulations as a computational engineer. His interests revolve around using the programming language Julia for modeling and running simulations efficiently on leadership computing systems.Sungho Shin received a B.S. degree in chemical engineering and mathematics from Seoul National University,Seoul,South Korea,in 2016. He received a Ph.D. degree in chemical engineering from the University of Wisconsin-Madison,Madison,WI,USA,in 2021. He is currently a postdoctoral researcher at the Mathematics and Computer Science Division of Argonne National Laboratory. His research interests include control theory and optimization algorithms for energy systems.Daniel Adrian Maldonado received the B.S. degree from the Universitat Politecnica de Valencia,Valencia,Spain,and the M.Eng. and Ph.D. degree from the Illinois Institute of Technology,Chicago. His research interests include transient stability simulation,state estimation,and optimal control.Mihai Anitescu is a senior computational mathematician in the Mathematics and Computer Science Division at Argonne National Laboratory and a professor in the Department of Statistics at the University of Chicago. He obtained his engineer diploma (electrical engineering) from the Polytechnic University of Bucharest in 1992 and his Ph.D. in applied mathematical and computational sciences from the University of Iowa in 1997. He specializes in the areas of numerical optimization,computational science,numerical analysis,and uncertainty quantification. He is on the editorial board of the SIAM Journal on Optimization,and he is a senior editor for Optimization Methods and Software. He is a past member of the editorial boards of Mathematical Programming A and B,SIAM Journal on Scientific Computing,and SIAM/ASA Journal in Uncertainty Quantification.

Abstract:We investigate how to port the standard interior-point method to new exascale architectures for block-structured nonlinear programs with state equations. Computationally, we decompose the interior-point algorithm into two successive operations: the evaluation of the derivatives and the solution of the associated Karush-Kuhn-Tucker (KKT) linear system. Our method accelerates both operations using two levels of parallelism. First, we distribute the computations on multiple processes using coarse parallelism. Second, each process uses SIMD/GPU accelerators locally to accelerate the operations using fine-grained parallelism. The KKT system is reduced by eliminating the inequalities and the state variables from the corresponding equations. We demonstrate our method's capability on the supercomputer Polaris, a testbed for the future exascale Aurora system. Each node is equipped with four GPUs, a setup amenable to our two-level approach. Our experiments on the stochastic optimal power flow problem show that the reduction method is 50x faster than the sparse linear solver HSL MA57 running in serial on the CPU, and 6x faster than Pardiso running in parallel on CPU on the same number of processes.

Towards a platform-portable linear algebra backend for OpenFOAM

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

Porting a sparse linear algebra math library to Intel GPUs

GPU-accelerated Linear Algebra for Coupled Solvers in Industrial CFD Applications with OpenFOAM

Porting Batched Iterative Solvers onto Intel GPUs with SYCL

GPU Acceleration of CFD Simulations in OpenFOAM

A graphics processing unit accelerated sparse direct solver and preconditioner with block low rank compression

A performance portable, fully implicit Landau collision operator with batched linear solvers

Evaluating performance portability of five shared-memory programming models using a high-order unstructured CFD solver

An Evaluation and Comparison of GPU Hardware and Solver Libraries for Accelerating the OPM Flow Reservoir Simulator

An adaptive finite element multigrid solver using GPU acceleration

Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels

Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing

Hybrid programming-model strategies for GPU offloading of electronic structure calculation kernels

Parallel interior-point solver for block-structured nonlinear programs on SIMD/GPU architectures

Multi-GPU aggregation-based AMG preconditioner for iterative linear solvers

Generalized Gpu Acceleration For Applications Employing Finite-Volume Methods

Architecture Specific Generation of Large Scale Lattice Boltzmann Methods for Sparse Complex Geometries

A Study of Performance Portability in Plasma Physics Simulations

Accelerating an Iterative Eigensolver for Nuclear Structure Configuration Interaction Calculations on GPUs Using OpenACC

Portability and Scalability of OpenMP Offloading on State-of-the-art Accelerators