Adversarial robust weighted Huber regression

Takeyuki Sasai,Hironori Fujisawa
2024-05-24
Abstract:We consider a robust estimation of linear regression coefficients. In this note, we focus on the case where the covariates are sampled from an $L$-subGaussian distribution with unknown covariance, the noises are sampled from a distribution with a bounded absolute moment and both covariates and noises may be contaminated by an adversary. We derive an estimation error bound, which depends on the stable rank and the condition number of the covariance matrix of covariates with a polynomial computational complexity of estimation.
Statistics Theory,Machine Learning
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the problem of robust estimation in linear regression, especially in scenarios where covariates and noise may be contaminated by adversaries. Specifically, the paper considers covariates that come from an L-sub-Gaussian distribution with an unknown covariance matrix, and noise with bounded absolute moments. The authors propose a new estimation method and derive an estimation error bound that depends on the stable rank and condition number of the covariance matrix, and can be computed in polynomial time. The key contributions of the paper are: 1. Even when the covariance matrix is unknown, the proposed estimation method can achieve a more accurate error bound when covariates come from an L-sub-Gaussian distribution. 2. A two-step weighted Huber regression algorithm is proposed, which includes preprocessing the covariates to reduce the impact of outliers. 3. An upper bound on the estimation error is provided in polynomial time, and its effectiveness is demonstrated. Overall, the paper aims to improve the robustness of linear regression models in the presence of adversarial interference and heavy-tailed distributions.