Spectral properties of high dimensional rescaled sample correlation matrices

Weijiang Chen,Shurong Zheng,Tingting Zou
2024-08-29
Abstract:High-dimensional sample correlation matrices are a crucial class of random matrices in multivariate statistical analysis. The central limit theorem (CLT) provides a theoretical foundation for statistical inference. In this paper, assuming that the data dimension increases proportionally with the sample size, we derive the limiting spectral distribution of the matrix $\widehat{\mathbf{R}}_n\mathbf{M}$ and establish the CLTs for the linear spectral statistics (LSS) of $\widehat{\mathbf{R}}_n\mathbf{M}$ in two structures: linear independent component structure and elliptical structure. In contrast to existing literature, our proposed spectral properties do not require $\mathbf{M}$ to be an identity matrix. Moreover, we also derive the joint limiting distribution of LSSs of $\widehat{\mathbf{R}}_n \mathbf{M}_1,\ldots,\widehat{\mathbf{R}}_n \mathbf{M}_K$. As an illustration, an application is given for the CLT.
Statistics Theory
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper primarily investigates the spectral properties of high-dimensional sample correlation matrices and establishes the Central Limit Theorem (CLT) for Linear Spectral Statistics (LSS) based on these properties. Specifically: 1. **Spectral Distribution**: The paper first derives the Limiting Spectral Distribution (LSD) of the high-dimensional sample correlation matrix \( \boldsymbol{R}_nM \), where \( M \) is a predetermined positive definite matrix. 2. **Central Limit Theorem**: - Under the elliptical structure assumption, the CLT for the LSS of \( \boldsymbol{R}_nM \) is established. - Under the independent component structure assumption, the CLT for the LSS of \( \boldsymbol{R}_nM \) is also established. - Furthermore, the paper derives the joint limiting distribution of the LSS for multiple high-dimensional sample correlation matrices \( \boldsymbol{R}_{nM_1}, \ldots, \boldsymbol{R}_{nM_K} \). 3. **Applications**: The paper proposes a testing method based on the difference and ratio between the sample correlation matrix and a given matrix \( R_0 \), to test whether the population correlation matrix equals the given matrix \( R_0 \). Through these theoretical results, the paper provides important statistical tools for high-dimensional data analysis and demonstrates their effectiveness in practical applications.