Fast convergence of sample-average approximation for saddle-point problems

Vitali Pirau
2023-12-13
Abstract:Stochastic saddle point (SSP) problems are, in general, less studied compared to stochastic minimization problems. However, SSP problems emerge from machine learning (adversarial training, e.g., GAN, AUC maximization), statistics (robust estimation), and game theory (stochastic matrix games). Notwithstanding the existing results on the convergence of stochastic gradient algorithms, there is little analysis on the generalization, i.e., how learned on train data models behave with new data samples. In contrast to existing $\mathcal{O}(\frac{1}{n})$ in expectation result <a class="link-https" data-arxiv-id="2006.02067" href="https://arxiv.org/abs/2006.02067">arXiv:2006.02067</a>, <a class="link-https" data-arxiv-id="2010.12561" href="https://arxiv.org/abs/2010.12561">arXiv:2010.12561</a> and $\mathcal{O}(\frac{\log(\frac{1}{\delta})}{\sqrt{n}})$ results with high probability <a class="link-https" data-arxiv-id="2105.03793" href="https://arxiv.org/abs/2105.03793">arXiv:2105.03793</a> , we present $\mathcal{O}(\frac{\log(\frac{1}{\delta})}{n})$ result in high probability for strongly convex-strongly concave setting. The main idea is local norms analysis. We illustrate our results in a matrix-game problem.
Optimization and Control
What problem does this paper attempt to address?