A Note on Distributed Quantile Regression by Pilot Sampling and One-Step Updating

Rui Pan,Tunan Ren,Baishan Guo,Feng Li,Guodong Li,Hansheng Wang
DOI: https://doi.org/10.1080/07350015.2021.1961789
2021-09-22
Journal of Business and Economic Statistics
Abstract:Quantile regression is a method of fundamental importance. How to efficiently conduct quantile regression for a large dataset on a distributed system is of great importance. We show that the popularly used one-shot estimation is statistically inefficient if data are not randomly distributed across different workers. To fix the problem, a novel one-step estimation method is developed with the following nice properties. First, the algorithm is communication efficient. That is the communication cost demanded is practically acceptable. Second, the resulting estimator is statistically efficient. That is its asymptotic covariance is the same as that of the global estimator. Third, the estimator is robust against data distribution. That is its consistency is guaranteed even if data are not randomly distributed across different workers. Numerical experiments are provided to corroborate our findings. A real example is also presented for illustration.
statistics & probability,social sciences, mathematical methods,economics
What problem does this paper attempt to address?