Abstract:This paper proposes a novel large-dimensional positive definite covariance estimator for high-frequency data under a general factor model framework. We demonstrate an appealing connection between the proposed estimator and a weighted group least absolute shrinkage and selection operator (LASSO) penalized least-squares estimator. The proposed estimator improves on traditional principal component analysis by allowing for weak factors, whose signal strengths are weak relative to idiosyncratic components. Despite the presence of microstructure noises and asynchronous trading, the proposed estimator achieves guarded positive definiteness without sacrificing the convergence rate. To make our method fully operational, we provide an extended simultaneous alternating direction method of multipliers algorithm to solve the resultant constrained convex minimization problem efficiently. Empirically, we study the monthly high-frequency covariance structure of the stock constituents of the S&P 500 index from 2008 to 2016, using all traded stocks from the NYSE, AMEX, and NASDAQ stock markets to construct the high-frequency Fama-French four and extended eleven economic factors. We further examine the out-of-sample performance of the proposed method through vast portfolio allocations, which deliver significantly reduced out-of-sample portfolio risk and enhanced Sharpe ratios. The success of our method supports the usefulness of machine learning techniques in finance. This paper was accepted by Agostino Capponi, finance. Funding: This work was supported by the Research Grants Council, University Grants Committee [Grants 11500119, 11505522, 11505721, and 21504818] and the National Natural Science Foundation of China (NSFC) Basic Scientific Center Project [Grant 71988101], entitled as “Econometric Modelling and Economic Policy Studies”, as well as NSFC [Grants 71803166 and 72173104]. Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mnsc.2022.04138 .

High Dimensional Covariance Matrix Estimation by Penalizing the Matrix-Logarithm Transformed Likelihood

Expectile regression for analyzing heteroscedasticity in high dimension

Large covariance matrix estimation via penalized log-det heuristics

Large-Dimensional Positive Definite Covariance Estimation for High Frequency Data via Low-rank and Sparse Matrix Decomposition

Penalized Sparse Covariance Regression with High Dimensional Covariates

A Regularized High-Dimensional Positive Definite Covariance Estimator with High-Frequency Data

Sparse estimation of a covariance matrix

Inference for High-Dimensional Linear Expectile Regression with De-Biasing Method

A Non-Parametric Shrinkage Mean Estimation for Arbitrary Quadratic Loss Functions and Unknown Covariance Matrices

High Dimensional Covariance Matrix Estimation Using Multi-Factor Models from Incomplete Information

Statistical Inference for Large-dimensional Matrix Factor Model from Least Squares and Huber Loss Points of View

A Dynamic Structure for High Dimensional Covariance Matrices and its Application in Portfolio Allocation

Principal regression for high dimensional covariance matrices

Matrix Factor Analysis: from Least Squares to Iterative Projection

Matrix Kendall's tau in High-dimensions: A Robust Statistic for Matrix Factor Model

Covariance Matrix Analysis for Optimal Portfolio Selection

Entropic covariance models

Covariance Model with General Linear Structure and Divergent Parameters

Blessing of dimension in Bayesian inference on covariance matrices

Optimal Eigenvalue Shrinkage in the Semicircle Limit

The Quadratic Optimization Bias Of Large Covariance Matrices