One component partial least squares, high dimensional regression, data splitting, and the multitude of models

David J. Olive,Lingling Zhang
DOI: https://doi.org/10.1080/03610926.2024.2303979
2024-01-24
Abstract:This article gives large sample theory for the one component partial least squares estimator, including some hypothesis tests for high dimensional data, under much weaker conditions than those in the literature. Simple theory is also given for some data splitting estimators and the marginal maximum likelihood estimators. It is shown that lasso, one component partial least squares, and ordinary least squares often estimate different population multiple linear regression models. The article also proves that there are often many valid population models for regression methods such as binary regression.
What problem does this paper attempt to address?