Tukey g-and-h neural network regression for non-Gaussian data

Arthur P. Guillaumin,Natalia Efremova
2024-11-13
Abstract:This paper addresses non-Gaussian regression with neural networks via the use of the Tukey g-and-h <a class="link-external link-http" href="http://distribution.The" rel="external noopener nofollow">this http URL</a> Tukey g-and-h transform is a flexible parametric transform with two parameters $g$ and $h$ which, when applied to a standard normal random variable, introduces both skewness and kurtosis, resulting in a distribution commonly called the Tukey g-and-h distribution. Specific values of $g$ and $h$ produce good approximations to other families of distributions, such as the Cauchy and student-t distributions. The flexibility of the Tukey g-and-h distribution has driven its popularity in the statistical community, in applied sciences and finance. In this work we consider the training of a neural network to predict the parameters of a Tukey g-and-h distribution in a regression framework via the minimization of the corresponding negative log-likelihood, despite the latter having no closed-form expression. We demonstrate the efficiency of our procedure in simulated examples and apply our method to a real-world dataset of global crop yield for several types of crops. Finally, we show how we can carry out a goodness-of-fit analysis between the predicted distributions and the test data. A Pytorch implementation is made available on Github and as a Pypi package.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in the regression tasks of non - Gaussian distribution data, how to use neural networks for more accurate probability distribution prediction. Specifically, the author introduced the Tukey g - and - h distribution to handle data with skewness and kurtosis. These characteristics are common in many practical applications, but the traditional Gaussian assumption cannot capture them well. ### Specific description of the problem 1. **Limitations of traditional methods**: - In many practical regression problems, the conditional distribution of the target variable may not be a Gaussian distribution, but a non - Gaussian distribution with skewness and kurtosis. - The traditional Gaussian distribution assumption is too strict and cannot capture the complex patterns in the data, especially in the presence of heavy - tails or multi - modalities. 2. **Deficiencies of existing solutions**: - Using a single Gaussian distribution or a mixture of Gaussian distributions to model non - Gaussian data has certain limitations, especially when dealing with heavy - tailed distributions. - Non - parametric methods such as quantile regression are flexible, but are difficult to implement and have high computational costs in some cases. 3. **The method proposed in the paper**: - The author proposed using the Tukey g - and - h distribution to model non - Gaussian data. By predicting the parameters (g, h, μ, σ) of the Tukey g - and - h distribution through a neural network, the skewness and kurtosis of the data can be better captured. - This method allows predicting a complete conditional probability distribution for each input feature, rather than just a single - point prediction, thus providing more abundant uncertainty information. ### Core technologies of the solution - **Tukey g - and - h transformation**: By subjecting a standard normal random variable to a parameterized Tukey g - and - h transformation, distributions with different skewness and kurtosis can be generated. - **Negative log - likelihood loss function**: To train the neural network, the author used the negative log - likelihood as the loss function and solved the inverse problem of the Tukey g - and - h transformation by the bisection search method. - **Application scenarios**: This method was applied to the practical problem of global crop yield prediction, demonstrating its advantages in handling non - Gaussian data. ### Experimental verification - **Simulated data experiments**: Verified the robustness and accuracy of this method under different distributions, including when the target variable follows a Student's t - distribution. - **Real - data experiments**: Applied to global crop yield prediction, showing the superiority of Tukey g - and - h prediction in fitting effect compared with Gaussian prediction. In summary, this paper aims to solve the challenges in non - Gaussian data regression tasks by introducing the Tukey g - and - h distribution and the corresponding neural network training method, and provide more accurate and flexible probability distribution prediction.