Explicit solutions for the asymptotically-optimal bandwidth in cross-validation

Karim M Abadir,Michel Lubrano
DOI: https://doi.org/10.1093/biomet/asae007
IF: 3.0279
2024-02-12
Biometrika
Abstract:Summary We show that least squares cross-validation methods share a common structure which has an explicit asymptotic solution, when the chosen kernel is asymptotically separable in bandwidth and data. For density estimation with a multivariate Student t(ν) kernel, the cross-validation criterion becomes asymptotically equivalent to a polynomial of only three terms. Our bandwidth formulae are simple and noniterative thus leading to very fast computations, their integrated squared-error dominates traditional cross-validation implementations, they alleviate the notorious sample variability of cross-validation, and overcome its breakdown in the case of repeated observations. We illustrate our method with univariate and bivariate applications, of density estimation and nonparametric regressions, to a large dataset of Michigan State University academic wages and experience.
statistics & probability,mathematical & computational biology,biology
What problem does this paper attempt to address?