High Dimension Low Sample Size Asymptotics of Robust PCA

Yi-Hui Zhou,J. S. Marron
DOI: https://doi.org/10.1214/15-ejs992
2015-01-01
Electronic Journal of Statistics
Abstract:Conventional principal component analysis is highly susceptible to outliers. In particular, a sufficiently outlying single data point, can draw the leading principal component toward itself. In this paper, we study the effects of outliers for high dimension and low sample size data, using asymptotics. The non-robust nature of conventional principal component analysis is verified through inconsistency under multivariate Gaussian assumptions with a single spike in the covariance structure, in the presence of a contaminating outlier. In the same setting, the robust method of spherical principal components is consistent with the population eigenvector for the spike model, even in the presence of contamination.
What problem does this paper attempt to address?