Does Principal Component Analysis Preserve the Sparsity in Sparse Weak Factor Models?

Jie Wei,Yonghui Zhang
DOI: https://doi.org/10.13140/RG.2.2.23601.04965
2023-05-10
Abstract:This paper studies the principal component (PC) method-based estimation of weak factor models with sparse loadings. We uncover an intrinsic near-sparsity preservation property for the PC estimators of loadings, which comes from the approximately upper triangular (block) structure of the rotation matrix. It implies an asymmetric relationship among factors: the rotated loadings for a stronger factor can be contaminated by those from a weaker one, but the loadings for a weaker factor is almost free of the impact of those from a stronger one. More importantly, the finding implies that there is no need to use complicated penalties to sparsify the loading estimators. Instead, we adopt a simple screening method to recover the sparsity and construct estimators for various factor strengths. In addition, for sparse weak factor models, we provide a singular value thresholding-based approach to determine the number of factors and establish uniform convergence rates for PC estimators, which complement Bai and Ng (2023). The accuracy and efficiency of the proposed estimators are investigated via Monte Carlo simulations. The application to the FRED-QD dataset reveals the underlying factor strengths and loading sparsity as well as their dynamic features.
Econometrics
What problem does this paper attempt to address?
This paper aims to solve the problem of whether the principal component analysis (PCA) method can maintain sparsity when estimating sparse weak factor models. Specifically, the paper explores the performance of the PCA method in dealing with weak factor models with sparse loadings and proposes a new method to recover the sparsity of loadings through a simple screening method. ### Specific problems that the paper attempts to solve include: 1. **Sparsity Preservation**: - Research whether the PCA estimator can preserve its sparsity when estimating the loadings of weak factor models. The paper finds that the PCA estimator has an inherent approximate sparsity - preserving property, which stems from the approximately upper - triangular (block) structure of the rotation matrix. - This property implies an asymmetric relationship between factors: the rotated loadings of stronger factors may be affected by weaker factors, but the loadings of weaker factors are hardly affected by stronger factors. 2. **No Need for Complex Penalties**: - Since the PCA estimator can naturally preserve the sparsity of loadings, there is no need to use complex penalty methods (such as ℓ1 regularization) to sparsify the loading estimator. Instead, a simple screening method can be used to recover sparsity and construct estimators of different factor strengths. 3. **Determination of the Number of Factors**: - For sparse weak factor models, the paper proposes a method based on singular value thresholding (SVT) to determine the number of factors and establishes the consistent convergence rate of the PC estimator, supplementing the research of Bai and Ng (2023). 4. **Theoretical Properties**: - Under the sparsity assumption, the consistency and asymptotic distribution of the PCA estimator for factors, loadings, and common components are proved, and their consistent convergence rates are established. - It is found that the rotation matrix has a special structure in PCA estimation, resulting in the PCA estimation of loadings being close to sparse. Based on this finding, the loading estimator is further precisely sparsified by screening the estimated PCA loadings. 5. **Practical Applications**: - The accuracy and efficiency of the proposed estimator are verified through Monte Carlo simulations, and the method is applied to the FRED - QD data set, revealing the potential factor strength and loading sparsity and their dynamic characteristics. ### Summary: The main contribution of this paper lies in revealing the sparsity - preserving property of the PCA estimator in sparse weak factor models and proposing a simple and effective screening method to recover the sparsity of loadings. In addition, the paper also provides a new method for determining the number of factors and establishes the consistency and convergence rate of related estimators, providing a new perspective and tool for the research of sparse weak factor models.