In search of rogue waves: a novel proposal distribution for parallelized rejection sampling of the truncated KdV Gibbs measure

Nicholas J. Moore,Brendan Foerster
2024-11-26
Abstract:The Gibbs ensemble of the truncated KdV (TKdV) equation has been shown to accurately describe the anomalous wave statistics observed in laboratory experiments, in particular the emergence of extreme events. Here, we introduce a novel proposal distribution that facilitates efficient rejection sampling of the TKdV Gibbs measure. Within parameter regimes accessible to laboratory experiments and capable of producing extreme events, the proposal distribution generates 1--6 orders of magnitude more accepted samples than does a naive, uniform distribution. When equipped with the new proposal distribution, a simple rejection algorithm enjoys key advantages over a Markov chain Monte Carlo algorithm, include better parallelization properties and generation of uncorrelated samples.
Numerical Analysis,Data Analysis, Statistics and Probability
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to sample the truncated KdV (TKdV) Gibbs measure efficiently, especially within the parameter range that can generate extreme events. Specifically, the author introduced a new proposal distribution to achieve efficient rejection sampling. Compared with the traditional uniform distribution and Markov chain Monte Carlo (MCMC) methods, this new method has significant advantages in parallelization performance and generating uncorrelated samples. ### Problem Background Exceptionally large water waves (such as freak waves, extreme waves or rogue waves) have attracted wide attention from the scientific community, marine practitioners and the public in recent years. Research shows that sudden changes in seabed topography can trigger abnormal wave activities. In particular, laboratory measurements show that the statistical properties of these waves can be explained by the statistical and dynamic analysis of the variable - depth Korteweg - de Vries (KdV) equation. In order to further study these phenomena, it is necessary to sample wave events efficiently from the Gibbs measure defined in the theoretical framework. ### Limitations of Existing Methods 1. **Uniform Distribution**: Using the uniform distribution as the proposal distribution in the rejection algorithm will lead to an extremely low acceptance rate. 2. **MCMC Method**: Although the MCMC algorithm is relatively efficient, it has some drawbacks: - Multiple adjustable parameters need to be adjusted. - There is a long "burn - in" time, resulting in poor parallelization performance. - There is a correlation between adjacent samples. ### Advantages of the New Method The new proposal distribution proposed in this paper combines the advantages of easy parallelization and uncorrelated samples of simple rejection sampling, while improving the sampling efficiency. By choosing a proposal distribution closer to the target distribution, the new method can generate more accepted samples within the parameter range of interest, thus enabling more effective study of extreme events. ### Main Contributions - **New Proposal Distribution**: Based on the work of Sun & Moore (2023), a proposal distribution that meets two criteria was constructed: it approximates the target distribution and is easy to sample. - **Efficient Rejection Sampling Algorithm**: An efficient rejection sampling algorithm was designed using the new proposal distribution, which performs well in parallelization performance and generating uncorrelated samples. - **Numerical Test Results**: The performance of the algorithm was verified through numerical tests, and its physical significance in different parameter ranges was demonstrated. ### Conclusion The new proposal distribution and the corresponding rejection sampling algorithm proposed in this paper provide an efficient method for studying extreme events in the truncated KdV equation, especially within the parameter range that can generate extreme events. This method not only improves the sampling efficiency, but also improves the parallelization performance, making large - scale simulations possible.