Optimal Subsampling Design for Polynomial Regression in one Covariate

Torsten Reuter,Rainer Schwabe
DOI: https://doi.org/10.48550/arXiv.2301.03295
2023-02-25
Abstract:Improvements in technology lead to increasing availability of large data sets which makes the need for data reduction and informative subsamples ever more important. In this paper we construct $ D $-optimal subsampling designs for polynomial regression in one covariate for invariant distributions of the covariate. We study quadratic regression more closely for specific distributions. In particular we make statements on the shape of the resulting optimal subsampling designs and the effect of the subsample size on the design. To illustrate the advantage of the optimal subsampling designs we examine the efficiency of uniform random subsampling.
Statistics Theory
What problem does this paper attempt to address?