Optimal subsampling algorithm for composite quantile regression with distributed data

Xiaohui Yuan,Shiting Zhou,Yue Wang
DOI: https://doi.org/10.48550/arXiv.2301.02448
2023-01-06
Abstract:For massive data stored at multiple machines, we propose a distributed subsampling procedure for the composite quantile regression. By establishing the consistency and asymptotic normality of the composite quantile regression estimator from a general subsampling algorithm, we derive the optimal subsampling probabilities and the optimal allocation sizes under the L-optimality criteria. A two-step algorithm to approximate the optimal subsampling procedure is developed. The proposed methods are illustrated through numerical experiments on simulated and real datasets.
Computation
What problem does this paper attempt to address?