Scalable Multilevel Monte Carlo Methods Exploiting Parallel Redistribution on Coarse Levels

Hillary R. Fairbanks,Delyan Z. Kalchev,Chak Shing Lee,Panayot S. Vassilevski
2024-08-05
Abstract:We study an element agglomeration coarsening strategy that requires data redistribution at coarse levels when the number of coarse elements becomes smaller than the used computational units (cores). The overall procedure generates coarse elements (general unstructured unions of fine grid elements) within the framework of element-based algebraic multigrid methods (or AMGe) studied previously. The AMGe generated coarse spaces have the ability to exhibit approximation properties of the same order as the fine-level ones since by construction they contain the piecewise polynomials of the same order as the fine level ones. These approximation properties are key for the successful use of AMGe in multilevel solvers for nonlinear partial differential equations as well as for multilevel Monte Carlo (MLMC) simulations. The ability to coarsen without being constrained by the number of available cores, as described in the present paper, allows to improve the scalability of these solvers as well as in the overall MLMC method. The paper illustrates this latter fact with detailed scalability study of MLMC simulations applied to model Darcy equations with a stochastic log-normal permeability field.
Numerical Analysis
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are: in the multilevel Monte Carlo (MLMC) method, how to improve computational efficiency and scalability by introducing the Algebraic Multigrid method based on element aggregation (AMGe) and parallel data redistribution techniques. Specifically, the research aims to solve the following two key problems: 1. **Bottlenecks Encountered in the Coarsening Process in a Multicore Environment**: - When implementing the aggregation - based Algebraic Multigrid method in a distributed parallel environment, when the coarsening process reaches a certain level, it is necessary to aggregate the degrees of freedom (dofs) at the current level onto different cores. In particular, when the number of new coarse - level aggregates is less than the number of cores, the communication cost will exceed the benefits brought by reducing the local problem size. At this time, data redistribution is required to improve the scalability at coarser levels and allow for level - independent data distribution. 2. **Scalability and Precision Preservation in the MLMC Method**: - The multilevel Monte Carlo method relies on a hierarchy of coarse spaces with guaranteed approximation properties, which are preserved in the energy norm. Traditional AMG methods may not be able to meet this requirement in some cases, while the AMGe method can generate piecewise polynomials identical to the fine level, thus ensuring approximation properties. Therefore, by introducing AMGe and data redistribution techniques, the scalability of the MLMC method can be significantly improved without affecting the precision. ### Formula Summary - **Expectation Decomposition**: \[ E[Q_0]=E[Q_L]+\sum_{\ell = 0}^{L - 1}E[Q_\ell - Q_{\ell + 1}] \] where \(Q_\ell\) represents the target quantity at the \(\ell\)-th level. - **MLMC Estimation**: \[ \hat{Q}_{ML}^0:=\sum_{\ell = 0}^{L}\hat{Y}_{MC}^\ell \] where, \[ \hat{Y}_{MC}^\ell:=\begin{cases} \frac{1}{N_\ell}\sum_{i = 1}^{N_\ell}(Q_\ell^{(i)}-Q_{\ell + 1}^{(i)}),&\text{for }\ell = 0,\ldots,L - 1\\ \frac{1}{N_L}\sum_{i = 1}^{N_L}Q_L^{(i)},&\text{for }\ell = L \end{cases} \] - **Optimal Sample Number Formula**: \[ N_\ell = 2\varepsilon^{-2}\sqrt{\frac{\text{Var}(Y_\ell)}{C_\ell}}\left(\sum_{k = 0}^{L}\sqrt{C_k\text{Var}(Y_k)}\right) \] where \(C_\ell\) is the computation time for sampling at the \(\ell\)-th level, and \(\varepsilon\) is the desired mean - square error (MSE). Through these methods and techniques, the paper shows how to apply the Darcy equation model in MLMC simulations, especially in the case of a stochastic log - normal permeability field, and significantly improves computational efficiency and scalability.