Stability and L2-penalty in Model Averaging

Hengkun Zhu,Guohua Zou

2023-11-23

Abstract:Model averaging has received much attention in the past two decades, which integrates available information by averaging over potential models. Although various model averaging methods have been developed, there are few literatures on the theoretical properties of model averaging from the perspective of stability, and the majority of these methods constrain model weights to a simplex. The aim of this paper is to introduce stability from statistical learning theory into model averaging. Thus, we define the stability, asymptotic empirical risk minimizer, generalization, and consistency of model averaging and study the relationship among them. Our results indicate that stability can ensure that model averaging has good generalization performance and consistency under reasonable conditions, where consistency means model averaging estimator can asymptotically minimize the mean squared prediction error. We also propose a L2-penalty model averaging method without limiting model weights and prove that it has stability and consistency. In order to reduce the impact of tuning parameter selection, we use 10-fold cross-validation to select a candidate set of tuning parameters and perform a weighted average of the estimators of model weights based on estimation errors. The Monte Carlo simulation and an illustrative application demonstrate the usefulness of the proposed method.

Machine Learning

What problem does this paper attempt to address?

The paper aims to address the following issues: 1. **Introducing the concept of stability**: The paper attempts to introduce the concept of "stability" from statistical learning theory into model averaging. Although there are various model averaging methods, theoretical research on the stability of model averaging is limited. The authors define the stability, asymptotic empirical risk minimization (AERM), generalization ability, and consistency of model averaging, and explore the relationships among them. The research results show that under reasonable conditions, stability can ensure good generalization performance and consistency of model averaging. 2. **Proposing an L2 penalty method for unconstrained weights**: For cases where model weights are not restricted, the paper proposes a new L2-penalty model averaging method, which does not constrain model weights and demonstrates its stability and consistency. To reduce the impact of parameter selection, the authors use 10-fold cross-validation to select a set of candidate parameters and perform weighted averaging based on estimation errors. Through the above two aspects of research, the paper aims to address the shortcomings of model averaging in stability theory and propose an improved method to enhance the generalization ability of model averaging.

Stability and L2-penalty in Model Averaging

Penalized Time-Varying Model Averaging

Model Averaging Estimation for Nonparametric Varying-Coefficient Models with Multiplicative Heteroscedasticity

On Asymptotic Optimality of Least Squares Model Averaging When True Model Is Included

A Scalable Frequentist Model Averaging Method

Parsimonious Model Averaging With a Diverging Number of Parameters

On High-Dimensional Asymptotic Properties of Model Averaging Estimators

Optimal Model Averaging Estimation for Generalized Linear Models and Generalized Linear Mixed-Effects Models

Jackknife Model Averaging for Additive Expectile Prediction

Post-averaging inference for optimal model averaging estimator in generalized linear models

Data-Driven Stochastic Averaging

Model averaging prediction by K -fold cross-validation

Partial Linear Model Averaging Prediction for Longitudinal Data

A General Framework For Frequentist Model Averaging

Model averaging for multivariate multiple regression models

Model Averaging for Generalized Linear Model with Covariates that are Missing completely at Random

A Model-Averaging Approach for High-Dimensional Regression

Model Averaging and Its Use in Economics

Jackknife Model Averaging for Mixed-Data Kernel-Weighted Spline Quantile Regressions

When and when not to use optimal model averaging

Model averaging: A shrinkage perspective