High Dimensional Covariance Matrix Estimation by Penalizing the Matrix-Logarithm Transformed Likelihood

Philip L. H. Yu,Xiaohang Wang,Yuanyuan Zhu
DOI: https://doi.org/10.1016/j.csda.2017.04.004
IF: 2.035
2017-01-01
Computational Statistics & Data Analysis
Abstract:It is well known that when the dimension of the data becomes very large, the sample covariance matrix S will not be a good estimator of the population covariance matrix Σ. Using such estimator, one typical consequence is that the estimated eigenvalues from S will be distorted. Many existing methods tried to solve the problem, and examples of which include regularizing Σ by thresholding or banding. In this paper, we estimate Σ by maximizing the likelihood using a new penalization on the matrix logarithm of Σ (denoted by A) of the form: ‖A−mI‖F2=∑i(log(di)−m)2, where di is the ith eigenvalue of Σ. This penalty aims at shrinking the estimated eigenvalues of A toward the mean eigenvalue m. The merits of our method are that it guarantees Σ to be non-negative definite and is computational efficient. The simulation study and applications on portfolio optimization and classification of genomic data show that the proposed method outperforms existing methods.
What problem does this paper attempt to address?