Large covariance matrix estimation via penalized log-det heuristics

Enrico Bernardi,Matteo Farnè
DOI: https://doi.org/10.48550/arXiv.2209.04867
2022-09-11
Abstract:This paper provides a comprehensive estimation framework for large covariance matrices via a log-det heuristics augmented by a nuclear norm plus $l_{1}$ norm penalty. %We develop the model framework, which includes high-dimensional approximate factor models with a sparse residual covariance. The underlying assumptions allow for non-pervasive latent eigenvalues and a prominent residual covariance pattern. We prove that the aforementioned log-det heuristics is locally convex with a Lipschitz-continuous gradient, so that a proximal gradient algorithm may be stated to numerically solve the problem while controlling the threshold parameters. The proposed optimization strategy recovers with high probability both the covariance matrix components and the latent rank and the residual sparsity pattern, and performs systematically not worse than the corresponding estimators employing Frobenius loss in place of the log-det heuristics. The error bounds for the ensuing low rank and sparse covariance matrix estimators are established, and the identifiability condition for the latent geometric manifolds is provided. The validity of outlined results is highlighted by means of an exhaustive simulation study and a real financial data example involving euro zone banks.
Statistics Theory,Methodology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the problem of estimating large - scale covariance matrices in high - dimensional data. Specifically, the paper proposes a log - determinant heuristic method with nuclear norm plus \(l_1\) - norm penalty to estimate large - scale covariance matrices. This method aims to solve the problem of inconsistent estimation of traditional sample covariance matrices when the number of variables \(p\) is much larger than the sample size \(n\). ### Main contributions of the paper 1. **Model framework**: - The paper proposes a model framework that includes a high - dimensional approximate factor model and a sparse residual covariance. - This model allows for non - ubiquitous latent eigenvalues and significant residual covariance patterns. 2. **Optimization strategy**: - It is proved that the log - determinant heuristic method is locally convex and its gradient is Lipschitz continuous. - A proximal gradient algorithm is proposed to numerically solve the problem while controlling the threshold parameters. 3. **Performance guarantee**: - It is proved that the proposed optimization strategy can recover each component of the covariance matrix, the latent rank, and the residual sparse pattern with high probability. - It is shown that this method is no worse in performance than the corresponding estimator using Frobenius loss instead of the log - determinant heuristic method. 4. **Theoretical results**: - Error bounds for low - rank and sparse covariance matrix estimators are established. - Identifiability conditions for the underlying geometric manifold are provided. 5. **Verification and application**: - The effectiveness of the proposed method is verified through extensive simulation studies and a real - world financial data example involving banks in the euro area. ### Formulas and mathematical properties 1. **Model definition**: \[ \Sigma^* = L^* + S^* = BB' + S^* \] where \(L^* = BB'\) is a positive semi - definite matrix with rank \(r < p\); \(S^*\) is a positive definite and element - sparse matrix, containing \(s \ll \frac{p(p - 1)}{2}\) non - zero off - diagonal elements. 2. **Optimization problem**: \[ (\hat{L}, \hat{S})=\arg\min_{L, S}(L(L, S)+P(L, S)) \] where \(P(L, S)=\psi\|L\|_*+\rho\|S\|_1\), \(\|L\|_*=\sum_{i = 1}^p\lambda_i(L)\) is the nuclear norm, \(\|S\|_1=\sum_{i = 1}^p\sum_{j = 1}^p|S_{ij}|\) is the \(l_1\) - norm, \(\psi\) and \(\rho\) are non - negative threshold parameters, and \(L(L, S)\) is a smooth loss function. 3. **Log - determinant loss**: \[ L_{ld}(L, S)=\frac{1}{2}\log\det(I_p+\Delta_n\Delta_n') \] where \(\Delta_n=\Sigma-\Sigma_n\), \(\Sigma = L + S\). 4. **Local convexity**: - It is proved that \(L_{ld}(L, S)\) is locally convex within a specific range. - The first - order and second - order derivatives of \(L_{ld}(L, S)\) and its locally convex region are given. 5. **Lipschitz continuity**: - The Lipschitz continuity of \(L_{ld}(L, S)\) and its gradient is proved. ### Conclusion By introducing the log - determinant heuristic method and the nuclear norm plus \(l_1\) - norm penalty, the paper provides an effective method for estimating high - dimensional covariance matrices. This method not only has good theoretical properties but also performs well in practical applications. Through strict mathematical derivations and empirical analyses, the paper verifies the effectiveness and reliability of this method.