A new robust covariance matrix estimation for high‐dimensional microbiome data

Jiyang Wang,Wanfeng Liang,Lijie Li,Yue Wu,Xiaoyan Ma
DOI: https://doi.org/10.1111/anzs.12415
2024-05-30
Australian & New Zealand Journal of Statistics
Abstract:Summary Microbiome data typically lie in a high‐dimensional simplex. One of the key questions in metagenomic analysis is to exploit the covariance structure for this kind of data. In this paper, a framework called approximate‐estimate‐threshold (AET) is developed for the robust basis covariance estimation for high‐dimensional microbiome data. To be specific, we first construct a proxy matrix Γ , which is almost indistinguishable from the real basis covariance matrix ∑ . Then, any estimator Γ^ satisfying some conditions can be used to estimate Γ . Finally, we impose a thresholding step on Γ^ to obtain the final estimator ∑^ . In particular, this paper applies a Huber‐type estimator Γ^ , and achieves robustness by only requiring the boundedness of 2+ε moments for some ε∈(0,2] . We derive the convergence rate of ∑^ under the spectral norm, and provide theoretical guarantees on support recovery. Extensive simulations and a real example are used to illustrate the empirical performance of our method.
statistics & probability
What problem does this paper attempt to address?