Testing the number of common factors by bootstrapped sample covariance matrix in high-dimensional factor models

Long Yu,Peng Zhao,Wang Zhou
2023-11-20
Abstract:This paper studies the impact of bootstrap procedure on the eigenvalue distributions of the sample covariance matrix under a high-dimensional factor structure. We provide asymptotic distributions for the top eigenvalues of bootstrapped sample covariance matrix under mild conditions. After bootstrap, the spiked eigenvalues which are driven by common factors will converge weakly to Gaussian limits after proper scaling and centralization. However, the largest non-spiked eigenvalue is mainly determined by the order statistics of the bootstrap resampling weights, and follows extreme value distribution. Based on the disparate behavior of the spiked and non-spiked eigenvalues, we propose innovative methods to test the number of common factors. Indicated by extensive numerical and empirical studies, the proposed methods perform reliably and convincingly under the existence of both weak factors and cross-sectionally correlated errors. Our technical details contribute to random matrix theory on spiked covariance model with convexly decaying density and unbounded support, or with general elliptical distributions.
Statistics Theory,Probability,Methodology
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily investigates the issue of estimating the number of common factors in high-dimensional factor models by using the bootstrap sample covariance matrix. #### Main Contributions: 1. **Theoretical Contributions**: - It studies the impact of the bootstrap process on the distribution of eigenvalues of the sample covariance matrix and provides the asymptotic distribution of the top eigenvalues under mild conditions. - For the spiked eigenvalues after bootstrapping, these eigenvalues weakly converge to a Gaussian limit after appropriate scaling and centering. - For non-spiked eigenvalues, they are mainly determined by the order statistics of the bootstrap resampling weights and follow an extreme value distribution. 2. **Methodological Innovations**: - It proposes new test-based methods to determine the number of common factors, which perform reliably in the presence of weak factors and cross-sectional correlated errors. - It contributes to random matrix theory, especially for spiked covariance models with convex decaying density and unbounded support or general elliptical distributions. #### Specific Issues: - **Determining the Number of Factors in High-Dimensional Factor Models**: Determining the number of common factors is a fundamental step in factor analysis. In fields like finance and econometrics, it remains an open question whether a new factor increases the explanatory power of asset pricing. - **Limitations of Traditional Methods**: Existing methods are mostly based on the different growth rates of factor and noise eigenvalues, but these methods may fail when the factors are weak. - **Application of Bootstrap Techniques**: Using bootstrap techniques to approximate the asymptotic distribution of sample eigenvalues, thereby improving the estimation of the number of factors. #### Methodology: - **Bootstrap Sample Covariance Matrix**: Constructing the bootstrap sample covariance matrix by resampling the original data and studying the asymptotic behavior of its eigenvalues. - **Distinguishing Spiked and Non-Spiked Eigenvalues**: Spiked eigenvalues are driven by common factors, while non-spiked eigenvalues are driven by idiosyncratic errors. - **Asymptotic Distribution**: Through appropriate scaling and centering, spiked eigenvalues converge to a Gaussian limit, while non-spiked eigenvalues converge to an extreme value distribution. #### Practical Applications: - **New Testing Methods**: Proposing new test-based methods to determine the number of common factors, which perform reliably in the presence of weak factors and cross-sectional correlated errors. - **Empirical Validation**: Extensive numerical and empirical studies validate the effectiveness and reliability of the proposed methods. In summary, this paper aims to improve the estimation methods for the number of common factors in high-dimensional factor models by using bootstrap techniques, especially in the presence of weak factors and cross-sectional correlated errors.