The Asymptotic Properties of the Extreme Eigenvectors of High-dimensional Generalized Spiked Covariance Model

Zhangni Pu,Xiaozhuo Zhang,Jiang Hu,Zhidong Bai
2024-05-14
Abstract:In this paper, we investigate the asymptotic behaviors of the extreme eigenvectors in a general spiked covariance matrix, where the dimension and sample size increase proportionally. We eliminate the restrictive assumption of the block diagonal structure in the population covariance matrix. Moreover, there is no requirement for the spiked eigenvalues and the 4th moment to be bounded. Specifically, we apply random matrix theory to derive the convergence and limiting distributions of certain projections of the extreme eigenvectors in a large sample covariance matrix within a generalized spiked population model. Furthermore, our techniques are robust and effective, even when spiked eigenvalues differ significantly in magnitude from nonspiked ones. Finally, we propose a powerful statistic for hypothesis testing for the eigenspaces of covariance matrices.
Statistics Theory
What problem does this paper attempt to address?
The problem this paper attempts to address is the asymptotic properties of extreme eigenvectors in high-dimensional generalized spiked covariance matrices. Specifically, the authors investigate the asymptotic behavior of extreme eigenvectors in generalized spiked covariance matrices as the sample size and dimension increase proportionally. Compared to previous studies, this paper removes the restrictive assumption of block diagonal structure in the population covariance matrix and does not require bounded spiked eigenvalues and fourth moments. The main contributions include: 1. **Relaxing assumptions**: Only conditions matching up to the second moment and tail probability \( P(|X| \geq x) = o(x^{-4}) \) are required, which is a necessary and sufficient condition for the weak convergence of the largest eigenvalue. 2. **Extending the model**: Considering a general non-negative definite matrix \( \Sigma = TT^* \), where \( T \) undergoes singular value decomposition, removing the assumption of diagonal block independence. 3. **Handling extreme cases**: Investigating cases where spiked eigenvalues exceed or fall below the relevant threshold, without requiring bounded spiked eigenvalues. With these improvements, the authors are able to more comprehensively understand and analyze complex systems in high-dimensional data, especially when dealing with spiked eigenvalues and eigenvectors. These results are significant for fields such as high-dimensional statistical inference, signal detection, and machine learning.