Abstract:In the Wishart model for sparse PCA we are given $n$ samples $Y_1,\ldots, Y_n$ drawn independently from a $d$-dimensional Gaussian distribution $N({0, Id + \beta vv^\top})$, where $\beta > 0$ and $v\in \mathbb{R}^d$ is a $k$-sparse unit vector, and we wish to recover $v$ (up to sign). We show that if $n \ge \Omega(d)$, then for every $t \ll k$ there exists an algorithm running in time $n\cdot d^{O(t)}$ that solves this problem as long as \[ \beta \gtrsim \frac{k}{\sqrt{nt}}\sqrt{\ln({2 + td/k^2})}\,. \] Prior to this work, the best polynomial time algorithm in the regime $k\approx \sqrt{d}$, called \emph{Covariance Thresholding} (proposed in [KNV15a] and analyzed in [DM14]), required $\beta \gtrsim \frac{k}{\sqrt{n}}\sqrt{\ln({2 + d/k^2})}$. For large enough constant $t$ our algorithm runs in polynomial time and has better guarantees than Covariance Thresholding. Previously known algorithms with such guarantees required quasi-polynomial time $d^{O(\log d)}$. In addition, we show that our techniques work with sparse PCA with adversarial perturbations studied in [dKNS20]. This model generalizes not only sparse PCA, but also other problems studied in prior works, including the sparse planted vector problem. As a consequence, we provide polynomial time algorithms for the sparse planted vector problem that have better guarantees than the state of the art in some regimes. Our approach also works with the Wigner model for sparse PCA. Moreover, we show that it is possible to combine our techniques with recent results on sparse PCA with symmetric heavy-tailed noise [dNNS22]. In particular, in the regime $k \approx \sqrt{d}$ we get the first polynomial time algorithm that works with symmetric heavy-tailed noise, while the algorithm from [dNNS22]. requires quasi-polynomial time in these settings.

Online Learning for Sparse PCA in High Dimensions: Exact Dynamics and Phase Transitions

Stochastic gradient descent in high dimensions for multi-spiked tensor PCA

Large-Dimensional Positive Definite Covariance Estimation for High Frequency Data via Low-rank and Sparse Matrix Decomposition

Dynamic Principal Subspaces with Sparsity in High Dimensions

Biologically Plausible Online Principal Component Analysis Without Recurrent Neural Dynamics

Online Kernel Learning with a Near Optimal Sparsity Bound

Gradient-based sparse principal component analysis with extensions to online learning

Phase Retrieval Using Iterative Projections: Dynamics in the Large Systems Limit

Dynamic Principal Subspaces in High Dimensions

Diffusion approximations of Oja's online principal component analysis

Algorithmic thresholds for tensor PCA

Orthogonal Sparse PCA and Covariance Estimation via Procrustes Reformulation

Convergence of Oja's online principal component flow

Schrödinger PCA: On the Duality between Principal Component Analysis and Schrödinger Equation

Nearly optimal stochastic approximation for online principal subspace estimation

Sparse PCA Beyond Covariance Thresholding

On the optimality of the Oja's algorithm for online PCA

An Acceleration Scheme for Memory Limited, Streaming PCA

Oja's Algorithm for Streaming Sparse PCA

Dynamic Principal Component Analysis in High Dimensions

Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices