Estimation of Characteristics-based Quantile Factor Models

Liang Chen,Juan Jose Dolado,Jesus Gonzalo,Haozi Pan
DOI: https://doi.org/10.48550/arXiv.2304.13206
2023-04-26
Abstract:This paper studies the estimation of characteristic-based quantile factor models where the factor loadings are unknown functions of observed individual characteristics while the idiosyncratic error terms are subject to conditional quantile restrictions. We propose a three-stage estimation procedure that is easily implementable in practice and has nice properties. The convergence rates, the limiting distributions of the estimated factors and loading functions, and a consistent selection criterion for the number of factors at each quantile are derived under general conditions. The proposed estimation methodology is shown to work satisfactorily when: (i) the idiosyncratic errors have heavy tails, (ii) the time dimension of the panel dataset is not large, and (iii) the number of factors exceeds the number of characteristics. Finite sample simulations and an empirical application aimed at estimating the loading functions of the daily returns of a large panel of S\&P500 index securities help illustrate these properties.
Econometrics
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to study the estimation methods for Characteristic-based Quantile Factor Models (CQFM). Specifically, the paper focuses on how to effectively estimate these models when the factor loadings are unknown functions of observed individual characteristics and the idiosyncratic error terms are subject to conditional quantile restrictions. ### Main Contributions 1. **Proposing a New Three-Stage Estimation Method**: The authors propose a three-stage estimation method called Quantile-Projected Principal Component Analysis (QPPCA). This method is computationally simpler than existing estimation methods and has good properties. 2. **Handling Heavy-Tailed Distribution Data**: The QPPCA method performs well when the idiosyncratic error terms have heavy-tailed distributions, which is particularly important in financial market data analysis, as financial data often exhibit heavy-tailed characteristics. 3. **Allowing the Number of Factors to Vary with Quantiles**: Unlike traditional factor models, CQFM allows the latent factors, loading functions, and the number of factors to vary across different quantiles, providing a more comprehensive description of the joint distribution of asset returns. 4. **Consistent Factor Number Selection Criterion**: The paper proposes a consistent factor number selection criterion that can effectively estimate the number of factors at each quantile with a reasonable sample size. ### Research Background Traditional factor models mainly have two approaches: - **Approximate Factor Models (AFM)**: Assume that factors are unobserved and need to be jointly estimated with factor loadings. This approach has identification issues because factors and loadings can only be determined up to a rotation matrix. - **Fama-French Models**: Construct factor proxies based on observable characteristics (such as market capitalization and book-to-market ratio) to explain the cross-sectional comovement of stock returns. The reliability of this approach decreases as the number of factors increases. ### Characteristics-based Factor Models (CFM) CFM attempts to combine the advantages of the above two approaches by assuming that factor loadings are smooth nonlinear functions of some observable characteristics, while factors remain unobserved. This way, even with a large number of factors, the latent factors can be easily estimated, but their interpretation depends on the chosen observable characteristics. ### Quantile Factor Models (QFM) QFM is an extension of traditional factor models, replacing mean constraints with quantile constraints. CQFM further extends QFM by allowing factor loadings to be functions of observed characteristics and the number of factors to vary with quantiles. ### Methods and Results 1. **Three-Stage Estimation Method**: - **First Stage**: Project the observations onto the characteristic space through sieve quantile regression. - **Second Stage**: Use the fitted values from the first stage to estimate factors and loadings through Principal Component Analysis (PCA). - **Third Stage**: Recover the entire loading function by projecting the estimated loadings onto the basis functions of the sieve space. 2. **Convergence Rates and Asymptotic Distributions**: Under very general conditions, the convergence rates and asymptotic distributions of the QPPCA estimated factors and loading functions are derived. 3. **Factor Number Selection**: A new factor number estimator is proposed, which performs well with a reasonable sample size. ### Empirical Application The paper validates the effectiveness of the QPPCA method through finite sample simulations and empirical applications (e.g., estimating the loading functions of daily returns of S&P 500 index securities). The results show that QPPCA can reveal significant variations in the estimated loading functions across different quantiles, which the traditional PPCA method cannot achieve. ### Conclusion This paper addresses the problem of effectively estimating characteristic-based quantile factor models under heavy-tailed distribution data and limited time series observations by proposing the QPPCA method. This method not only has good theoretical properties but also performs well in practical applications.