Abstract:This paper provides a comprehensive estimation framework for large covariance matrices via a log-det heuristics augmented by a nuclear norm plus $l_{1}$ norm penalty. %We develop the model framework, which includes high-dimensional approximate factor models with a sparse residual covariance. The underlying assumptions allow for non-pervasive latent eigenvalues and a prominent residual covariance pattern. We prove that the aforementioned log-det heuristics is locally convex with a Lipschitz-continuous gradient, so that a proximal gradient algorithm may be stated to numerically solve the problem while controlling the threshold parameters. The proposed optimization strategy recovers with high probability both the covariance matrix components and the latent rank and the residual sparsity pattern, and performs systematically not worse than the corresponding estimators employing Frobenius loss in place of the log-det heuristics. The error bounds for the ensuing low rank and sparse covariance matrix estimators are established, and the identifiability condition for the latent geometric manifolds is provided. The validity of outlined results is highlighted by means of an exhaustive simulation study and a real financial data example involving euro zone banks.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the problem of estimating large - scale covariance matrices in high - dimensional data. Specifically, the paper proposes a log - determinant heuristic method with nuclear norm plus $l_1$ - norm penalty to estimate large - scale covariance matrices. This method aims to solve the problem of inconsistent estimation of traditional sample covariance matrices when the number of variables $p$ is much larger than the sample size $n$. ### Main contributions of the paper 1. **Model framework**: - The paper proposes a model framework that includes a high - dimensional approximate factor model and a sparse residual covariance. - This model allows for non - ubiquitous latent eigenvalues and significant residual covariance patterns. 2. **Optimization strategy**: - It is proved that the log - determinant heuristic method is locally convex and its gradient is Lipschitz continuous. - A proximal gradient algorithm is proposed to numerically solve the problem while controlling the threshold parameters. 3. **Performance guarantee**: - It is proved that the proposed optimization strategy can recover each component of the covariance matrix, the latent rank, and the residual sparse pattern with high probability. - It is shown that this method is no worse in performance than the corresponding estimator using Frobenius loss instead of the log - determinant heuristic method. 4. **Theoretical results**: - Error bounds for low - rank and sparse covariance matrix estimators are established. - Identifiability conditions for the underlying geometric manifold are provided. 5. **Verification and application**: - The effectiveness of the proposed method is verified through extensive simulation studies and a real - world financial data example involving banks in the euro area. ### Formulas and mathematical properties 1. **Model definition**: \[ \Sigma^* = L^* + S^* = BB' + S^* \] where $L^* = BB'$ is a positive semi - definite matrix with rank $r < p$; $S^*$ is a positive definite and element - sparse matrix, containing $s \ll \frac{p(p - 1)}{2}$ non - zero off - diagonal elements. 2. **Optimization problem**: \[ (\hat{L}, \hat{S})=\arg\min_{L, S}(L(L, S)+P(L, S)) \] where $P(L, S)=\psi\|L\|_*+\rho\|S\|_1$, $\|L\|_*=\sum_{i = 1}^p\lambda_i(L)$ is the nuclear norm, $\|S\|_1=\sum_{i = 1}^p\sum_{j = 1}^p|S_{ij}|$ is the $l_1$ - norm, $\psi$ and $\rho$ are non - negative threshold parameters, and $L(L, S)$ is a smooth loss function. 3. **Log - determinant loss**: \[ L_{ld}(L, S)=\frac{1}{2}\log\det(I_p+\Delta_n\Delta_n') \] where $\Delta_n=\Sigma-\Sigma_n$, $\Sigma = L + S$. 4. **Local convexity**: - It is proved that $L_{ld}(L, S)$ is locally convex within a specific range. - The first - order and second - order derivatives of $L_{ld}(L, S)$ and its locally convex region are given. 5. **Lipschitz continuity**: - The Lipschitz continuity of $L_{ld}(L, S)$ and its gradient is proved. ### Conclusion By introducing the log - determinant heuristic method and the nuclear norm plus $l_1$ - norm penalty, the paper provides an effective method for estimating high - dimensional covariance matrices. This method not only has good theoretical properties but also performs well in practical applications. Through strict mathematical derivations and empirical analyses, the paper verifies the effectiveness and reliability of this method.

Large covariance matrix estimation via penalized log-det heuristics

Large-Dimensional Positive Definite Covariance Estimation for High Frequency Data via Low-rank and Sparse Matrix Decomposition

Penalized Sparse Covariance Regression with High Dimensional Covariates

Sparse estimation of a covariance matrix

A Regularized High-Dimensional Positive Definite Covariance Estimator with High-Frequency Data

Covariance Estimation in High Dimensions Via Kronecker Product Expansions

Nonparametric estimation of large covariance matrices with conditional sparsity

Entropic covariance models

Factor-guided estimation of large covariance matrix function with conditional functional sparsity

Estimation of high-dimensional low-rank matrices

Towards a sparse, scalable, and stably positive definite (inverse) covariance estimator

Statistical Inference for Large-dimensional Matrix Factor Model from Least Squares and Huber Loss Points of View

Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions

Principal regression for high dimensional covariance matrices

Covariance Structure Estimation with Laplace Approximation

Estimating linear covariance models with numerical nonlinear algebra

Adaptive Covariance Estimation with model selection

A multilevel framework for sparse optimization with application to inverse covariance estimation and logistic regression

Heteroskedasticity-robust inference in linear regression models with many covariates

Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

Mean and variance estimation in high-dimensional heteroscedastic models with non-convex penalties