Modeling High-Dimensional Time Series: A Factor Model with Dynamically Dependent Factors and Diverging Eigenvalues

Zhaoxing Gao,Ruey S. Tsay
DOI: https://doi.org/10.1080/01621459.2020.1862668
2020-07-01
Abstract:This article proposes a new approach to modeling high-dimensional time series by treating a $p$-dimensional time series as a nonsingular linear transformation of certain common factors and idiosyncratic components. Unlike the approximate factor models, we assume that the factors capture all the non-trivial dynamics of the data, but the cross-sectional dependence may be explained by both the factors and the idiosyncratic components. Under the proposed model, (a) the factor process is dynamically dependent and the idiosyncratic component is a white noise process, and (b) the largest eigenvalues of the covariance matrix of the idiosyncratic components may diverge to infinity as the dimension $p$ increases. We propose a white noise testing procedure for high-dimensional time series to determine the number of white noise components and, hence, the number of common factors, and introduce a projected Principal Component Analysis (PCA) to eliminate the diverging effect of the idiosyncratic noises. Asymptotic properties of the proposed method are established for both fixed $p$ and diverging $p$ as the sample size $n$ increases to infinity. We use both simulated data and real examples to assess the performance of the proposed method. We also compare our method with two commonly used methods in the literature concerning the forecastability of the extracted factors and find that the proposed approach not only provides interpretable results, but also performs well in out-of-sample forecasting. Supplementary materials of the article are available online.
Methodology
What problem does this paper attempt to address?
This paper attempts to solve several key problems in high - dimensional time - series modeling: 1. **Dynamic Dependence**: In high - dimensional time - series, dynamic dependence is very complex. Existing factor models usually assume that factors capture most of the dynamic dependence in the data, while idiosyncratic components mainly contain some unimportant dynamic information. However, this assumption may not hold in practical applications because idiosyncratic components may also contain important dynamic dependence. Therefore, this paper proposes a new factor model in which the factor process is dynamically dependent and the idiosyncratic component is a white - noise process. 2. **Divergence of Eigenvalues**: In high - dimensional time - series, the largest eigenvalue of the covariance matrix of idiosyncratic components may diverge to infinity as the dimension \(p\) increases. This phenomenon has not been fully considered in existing methods and may lead to bias in model estimation. This paper proposes a Projected PCA (Principal Component Analysis) method to eliminate the divergence effect of idiosyncratic components. 3. **Determining the Number of Factors**: Determining the number of common factors in high - dimensional time - series is a challenge. Existing methods such as ratio - based methods may not be reliable enough in high - dimensional cases. This paper proposes a new method based on white - noise testing to determine the number of common factors. This method is more stable and reliable in high - dimensional cases. Specifically, the main contributions of this paper are as follows: - **Model Flexibility**: The proposed model allows common factors and idiosyncratic components to have multiple structures and can be more flexibly applied to different high - dimensional time - series data. - **Eliminating the Influence of Idiosyncratic Components**: Through the Projected PCA method, the influence of idiosyncratic components on the estimation of common factors can be effectively eliminated, especially in the case of eigenvalue divergence. - **Reliable Estimation of the Number of Factors**: A method based on white - noise testing is proposed to determine the number of common factors. This method performs better in high - dimensional cases and can more accurately extract dynamically - dependent factors. Through these improvements, this paper provides a more effective and reliable method for processing high - dimensional time - series data, especially for applications in fields such as finance, economics, and environmental science.