Efficient private SCO for heavy-tailed data via averaged clipping

Chenhan Jin,Kaiwen Zhou,Bo Han,James Cheng,Tieyong Zeng
DOI: https://doi.org/10.1007/s10994-024-06617-9
IF: 5.414
2024-10-02
Machine Learning
Abstract:We consider stochastic convex optimization for heavy-tailed data with the guarantee of being differentially private (DP). Most prior works on differentially private stochastic convex optimization for heavy-tailed data are either restricted to gradient descent (GD) or performed multi-times clipping on stochastic gradient descent (SGD), which is inefficient for large-scale problems. In this paper, we consider a one-time clipping strategy and provide principled analyses of its bias and private mean estimation. We establish new convergence results and improved complexity bounds for the proposed algorithm called AClipped-dpSGD for constrained and unconstrained convex problems. We also extend our convergent analysis to the strongly convex case and non-smooth case (which works for generalized smooth objectives with H lder-continuous gradients). All the above results are guaranteed with a high probability for heavy-tailed data. Numerical experiments are conducted to justify the theoretical improvement.
computer science, artificial intelligence
What problem does this paper attempt to address?