The graft‐versus‐host problem for data‐driven gravity‐wave parameterizations in a one‐dimensional quasibiennial oscillation model

Ofer Shamir,David S. Connelly,Steven C. Hardiman,Zihan Shao,L. Minah Yang,Edwin P. Gerber
DOI: https://doi.org/10.1002/qj.4707
2024-03-28
Quarterly Journal of the Royal Meteorological Society
Abstract:Data‐driven models can emulate the gravity‐wave drags in a one‐dimensional quasibiennial oscillation model on which they were trained accurately, yielding the correct winds when coupled back to the one‐dimensional model. However, they are sensitive to perturbations in the gravity‐wave sources, for example, due to climate model biases or climate change. An effective solution for model biases is to remap the wave sources before feeding them to the data‐driven methods. However, the response to climate change is an ongoing challenge. Two key challenges in the development of data‐driven gravity‐wave parameterizations are generalization, how to ensure that a data‐driven scheme trained on the present‐day climate will continue to work in a new climate regime, and calibration, how to account for biases in the "host" climate model. Both problems depend fundamentally on the response to out‐of‐sample inputs compared with the training dataset, and are often conflicting. The ability to generalize to new climate regimes often goes hand in hand with sensitivity to model biases. To probe these challenges, we employ a one‐dimensional (1D) quasibiennial oscillation (QBO) model with a stochastic source term that represents convectively generated gravity waves in the Tropics with randomly varying strengths and spectra. We employ an array of machine‐learning models consisting of a fully connected feed‐forward neural network, a dilated convolutional neural network, an encoder–decoder, a boosted forest, and a support‐vector regression model. Our results demonstrate that data‐driven schemes trained on "observations" can be critically sensitive to model biases in the wave sources. While able to emulate accurately the stochastic source term on which they were trained, all of our schemes fail to simulate fully the expected QBO period or amplitude, even with the slightest perturbation to the wave sources. The main takeaway is that some measures will always be required to ensure the proper response to climate change and to account for model biases. We examine one approach based on the ideas of optimal transport, where the wave sources in the model are first remapped to the observed one before applying the data‐driven scheme. This approach is agnostic to the data‐driven method and guarantees that the model adheres to the observational constraints, making sure the model yields the right results for the right reasons.
meteorology & atmospheric sciences
What problem does this paper attempt to address?