Fast Projected Newton-like Method for Precision Matrix Estimation under Total Positivity

Jian-Feng Cai,José Vinícius de M. Cardoso,Daniel P. Palomar,Jiaxi Ying
2023-10-23
Abstract:We study the problem of estimating precision matrices in Gaussian distributions that are multivariate totally positive of order two ($\mathrm{MTP}_2$). The precision matrix in such a distribution is an M-matrix. This problem can be formulated as a sign-constrained log-determinant program. Current algorithms are designed using the block coordinate descent method or the proximal point algorithm, which becomes computationally challenging in high-dimensional cases due to the requirement to solve numerous nonnegative quadratic programs or large-scale linear systems. To address this issue, we propose a novel algorithm based on the two-metric projection method, incorporating a carefully designed search direction and variable partitioning scheme. Our algorithm substantially reduces computational complexity, and its theoretical convergence is established. Experimental results on synthetic and real-world datasets demonstrate that our proposed algorithm provides a significant improvement in computational efficiency compared to the state-of-the-art methods.
Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve the problem of estimating the precision matrix (i.e., the inverse covariance matrix) satisfying the multivariate total positive of order two (MTP2) constraint in high - dimensional cases. Specifically, the paper focuses on the estimation problem of symmetric M - matrices with all non - diagonal elements being non - positive in the Gaussian distribution. This problem can be formulated as a signed - constrained log - determinant programming problem. Existing algorithms such as the block coordinate descent (BCD) method and the proximal point algorithm (PPA) become computationally complex and time - consuming in high - dimensional cases because they need to solve a large number of non - negative quadratic programming or large - scale linear systems. To this end, the authors propose a fast projected Newton - like algorithm based on the bi - metric projection method to significantly reduce the computational complexity and improve the convergence speed. ### Specific Problem Description 1. **Problem Background**: - The precision matrix estimation problem is very important in the Gaussian distribution, especially under the multivariate total positive of order two (MTP2) constraint. - The precision matrix under the MTP2 constraint is an M - matrix, and all its non - diagonal elements are non - positive. - This problem has wide applications in many fields, such as classification inference, graph signal processing, factor analysis in psychometrics, and financial markets. 2. **Limitations of Existing Methods**: - Existing methods such as BCD and PPA perform well in low - dimensional cases, but in high - dimensional cases, due to the need to solve a large number of non - negative quadratic programming or large - scale linear systems, they have high computational complexity and are time - consuming. - Although the gradient projection method has high computational efficiency per step, it has a slow convergence speed in high - dimensional cases. 3. **Research Objectives**: - Propose an efficient and scalable algorithm to estimate the precision matrix under the MTP2 constraint. - By introducing the bi - metric projection method, design the search direction and variable partitioning scheme to avoid solving non - negative quadratic programming or linear systems, thereby significantly reducing the computational complexity. ### Main Contributions of the Paper 1. **Algorithm Design**: - Propose a fast projected Newton - like algorithm based on the bi - metric projection method. - By using carefully designed search directions and variable partitioning schemes, avoid solving non - negative quadratic programming or linear systems, and significantly reduce the computational complexity. - The computational complexity of each iteration of the algorithm is the same as that of the gradient projection method. 2. **Theoretical Analysis**: - Prove that the algorithm converges to the minimization point of the target problem. - Under mild assumptions, prove that the set of free variables converges to the support set of the minimization point within a finite number of iterations and provide the convergence rate of the algorithm. 3. **Experimental Results**: - The experimental results on synthetic data and real - world datasets show that the proposed algorithm is significantly more computationally efficient than existing methods. - When applied to financial time - series data, it shows a significant performance improvement in terms of network modularity values. Through these contributions, the paper solves the problems of high computational complexity and slow convergence speed of existing methods in high - dimensional cases, and provides a more efficient and scalable precision matrix estimation method.