Abstract:Asymptotic optimality is a key theoretical property in model averaging. Due to technical difficulties, existing studies rely on restricted weight sets or the assumption that there is no true model with fixed dimensions in the candidate set. The focus of this paper is to overcome these difficulties. Surprisingly, we discover that when the penalty factor in the weight selection criterion diverges with a certain order and the true model dimension is fixed, asymptotic loss optimality does not hold, but asymptotic risk optimality does. This result differs from the corresponding result of Fang et al. (2023, Econometric Theory 39, 412-441) and reveals that using the discrete weight set of Hansen (2007, Econometrica 75, 1175-1189) can yield opposite asymptotic properties compared to using the usual weight set. Simulation studies illustrate the theoretical findings in a variety of settings.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the asymptotic optimality problem of the least - squares model averaging method in a set of candidate models that contains the true model. Specifically, the paper focuses on how to overcome the technical difficulties in existing research when the true model exists in the candidate models. These difficulties usually involve limitations on the weight set or the assumption that there is no true model with a fixed dimension in the set of candidate models. The paper finds that when the penalty factor in the weight selection criterion diverges at a certain order and the dimension of the true model is fixed, although the asymptotic loss optimality does not hold, the asymptotic risk optimality still holds. This result is different from the research results of Fang et al. (2023) and reveals the opposite performance in asymptotic properties between using Hansen (2007)'s discrete weight set and the common weight set. ### Background of the Paper The least - squares model averaging method has received extensive attention in econometrics and statistics, especially since Hansen (2007) proposed Mallows model averaging (MMA). Subsequently, many alternative methods have been proposed for selecting weights for model averaging, including optimal mean - squared - error averaging, Jackknife model averaging, heteroscedastic - robust Cp model averaging, predictive model averaging, Kullback - Leibler model averaging, parsimonious model averaging, etc. Asymptotic optimality is a key theoretical goal in model - averaging research. It indicates that as the sample size increases, the model - averaging estimator can achieve the smallest predicted loss (or risk) among all such estimators. Most of the above - mentioned model - averaging methods have been proven to be asymptotically optimal under certain conditions. However, these results usually require that all candidate models with a fixed dimension are approximate models, that is, all candidate models are mis - specified. ### Research Questions In the field of model averaging, another common scenario is that the true model exists in the candidate models. This scenario is also often encountered in model selection. Existing research mainly focuses on the asymptotic behavior of the selected weights and the asymptotic distribution of the model - averaging estimator, but theoretical results on asymptotic optimality in this setting are rare. Fang et al. (2023) made important contributions in this regard, but under technical difficulties, their results rely on a slightly special weight set. ### Main Contributions This paper re - examines the asymptotic optimality of the least - squares model averaging in the nested - model setting when the true model exists in the candidate models. Compared with the results of Fang et al. (2023), the main contributions of this paper are two - fold: 1. The results are applicable to general weight sets, not just the weight sets they restricted. 2. The asymptotic risk optimality is explored, which is a topic not covered by Fang et al. (2023). ### Main Findings - When the penalty factor \(\phi_n=\log n\) and the dimension of the true model is fixed, the asymptotic loss optimality does not hold, which is different from the corresponding results of Fang et al. (2023). - When \(\phi_n = \log n\) and the dimension of the true model does not diverge too fast, the asymptotic risk optimality holds. - When \(\phi_n=2\) and the model dimension is fixed, MMA is neither asymptotically loss - optimal nor asymptotically risk - optimal, which is consistent with the corresponding results of Fang et al. (2023). ### Paper Structure - Section 2 introduces a class of least - squares model - averaging methods, including MMA and PMA as special cases. - Section 3 reviews existing results and conducts discussions. - Section 4 presents the main theoretical results. - Section 5 provides the results of a finite - sample simulation study. - Section 6 summarizes the full text. - The proofs of the main results are provided in the appendix. Through these studies, the paper fills the gaps in the existing literature and provides new insights into the asymptotic properties of the model - averaging method when the true model exists.

On Asymptotic Optimality of Least Squares Model Averaging When True Model Is Included

Penalized Time-Varying Model Averaging

Parsimonious Model Averaging With a Diverging Number of Parameters

Model Averaging Estimation for Nonparametric Varying-Coefficient Models with Multiplicative Heteroscedasticity

Stability and L2-penalty in Model Averaging

A Scalable Frequentist Model Averaging Method

Optimal Model Averaging Estimation for Generalized Linear Models and Generalized Linear Mixed-Effects Models

On High-Dimensional Asymptotic Properties of Model Averaging Estimators

Model Averaging for Generalized Linear Model with Covariates that are Missing completely at Random

Model averaging for right censored data with measurement error

Jackknife Model Averaging for Additive Expectile Prediction

Optimal model averaging for partially linear models with missing response variables and error‐prone covariates

On optimality of Mallows model averaging

Post-averaging inference for optimal model averaging estimator in generalized linear models

Model Averaging for Accelerated Failure Time Models with Missing Censoring Indicators

Partial Linear Model Averaging Prediction for Longitudinal Data

When and when not to use optimal model averaging

Model Averaging for Estimating Treatment Effects With Binary Responses

Jackknife Model Averaging for Mixed-Data Kernel-Weighted Spline Quantile Regressions

Frequentist Model Averaging under Inequality Constraints

Model averaging prediction by K -fold cross-validation