Abstract:This paper addresses non-Gaussian regression with neural networks via the use of the Tukey g-and-h <a class="link-external link-http" href="http://distribution.The" rel="external noopener nofollow">this http URL</a> Tukey g-and-h transform is a flexible parametric transform with two parameters $g$ and $h$ which, when applied to a standard normal random variable, introduces both skewness and kurtosis, resulting in a distribution commonly called the Tukey g-and-h distribution. Specific values of $g$ and $h$ produce good approximations to other families of distributions, such as the Cauchy and student-t distributions. The flexibility of the Tukey g-and-h distribution has driven its popularity in the statistical community, in applied sciences and finance. In this work we consider the training of a neural network to predict the parameters of a Tukey g-and-h distribution in a regression framework via the minimization of the corresponding negative log-likelihood, despite the latter having no closed-form expression. We demonstrate the efficiency of our procedure in simulated examples and apply our method to a real-world dataset of global crop yield for several types of crops. Finally, we show how we can carry out a goodness-of-fit analysis between the predicted distributions and the test data. A Pytorch implementation is made available on Github and as a Pypi package.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: in the regression tasks of non - Gaussian distribution data, how to use neural networks for more accurate probability distribution prediction. Specifically, the author introduced the Tukey g - and - h distribution to handle data with skewness and kurtosis. These characteristics are common in many practical applications, but the traditional Gaussian assumption cannot capture them well. ### Specific description of the problem 1. **Limitations of traditional methods**: - In many practical regression problems, the conditional distribution of the target variable may not be a Gaussian distribution, but a non - Gaussian distribution with skewness and kurtosis. - The traditional Gaussian distribution assumption is too strict and cannot capture the complex patterns in the data, especially in the presence of heavy - tails or multi - modalities. 2. **Deficiencies of existing solutions**: - Using a single Gaussian distribution or a mixture of Gaussian distributions to model non - Gaussian data has certain limitations, especially when dealing with heavy - tailed distributions. - Non - parametric methods such as quantile regression are flexible, but are difficult to implement and have high computational costs in some cases. 3. **The method proposed in the paper**: - The author proposed using the Tukey g - and - h distribution to model non - Gaussian data. By predicting the parameters (g, h, μ, σ) of the Tukey g - and - h distribution through a neural network, the skewness and kurtosis of the data can be better captured. - This method allows predicting a complete conditional probability distribution for each input feature, rather than just a single - point prediction, thus providing more abundant uncertainty information. ### Core technologies of the solution - **Tukey g - and - h transformation**: By subjecting a standard normal random variable to a parameterized Tukey g - and - h transformation, distributions with different skewness and kurtosis can be generated. - **Negative log - likelihood loss function**: To train the neural network, the author used the negative log - likelihood as the loss function and solved the inverse problem of the Tukey g - and - h transformation by the bisection search method. - **Application scenarios**: This method was applied to the practical problem of global crop yield prediction, demonstrating its advantages in handling non - Gaussian data. ### Experimental verification - **Simulated data experiments**: Verified the robustness and accuracy of this method under different distributions, including when the target variable follows a Student's t - distribution. - **Real - data experiments**: Applied to global crop yield prediction, showing the superiority of Tukey g - and - h prediction in fitting effect compared with Gaussian prediction. In summary, this paper aims to solve the challenges in non - Gaussian data regression tasks by introducing the Tukey g - and - h distribution and the corresponding neural network training method, and provide more accurate and flexible probability distribution prediction.

Tukey g-and-h neural network regression for non-Gaussian data

Learning Gaussian Networks

Neural Networks: Further Insights into Error Function, Generalized Weights and Others

Theoretical considerations when simulating data from the g-and-h family of distributions

Asymmetric Heavy Tails and Implicit Bias in Gaussian Noise Injections

Neural Networks for Geospatial Data

H-GPR: A HYBRID STRATEGY FOR LARGE-SCALE GAUSSIAN PROCESS REGRESSION

Efficient Learning Algorithms for Gaussian Processes

Load forecasting using Twin Gaussian Process model

An Intuitive Tutorial to Gaussian Process Regression

Statistical Inference on High Dimensional Gaussian Graphical Regression Models

Tensorized Hypergraph Neural Networks

Monte Carlo Implementation of Gaussian Process Models for Bayesian Regression and Classification

Generalized Regression with Conditional GANs

gGN: Representing the Gene Ontology as low-rank Gaussian distributions

New Boosting Methods Of Gaussian Processes For Regression

Regularized Gauss-Newton for Optimizing Overparameterized Neural Networks

Student-t Process Regression with Dependent Student-t Noise.

Analyzing and Learning Sparse and Scale-Free Networks Using Gaussian Graphical Models.

Penalized gaussian process regression and classification for high-dimensional nonlinear data

Neural-g: A Deep Learning Framework for Mixing Density Estimation