Identifiability in robust estimation of tree structured models

Marta Casanellas,Marina Garrote-López,Piotr Zwiernik
DOI: https://doi.org/10.3150/22-bej1477
IF: 1.5
2024-02-01
Bernoulli
Abstract:Consider the problem of learning undirected graphical models on trees from corrupted data. Recently Katiyar, Shah, and Caramanis showed that it is possible to recover trees from noisy binary data up to a small equivalence class of possible trees. Another paper by Katiyar, Hoffmann, and Caramanis follows a similar pattern for the Gaussian case. By framing this as a special phylogenetic recovery problem we largely generalize these two settings. Using the framework of linear latent tree models we discuss tree identifiability for binary data under a continuous corruption model (e.g. black/white images with greyscale corruption). For the Ising and the Gaussian tree model we also provide a characterisation of when the Chow-Liu algorithm consistently learns the underlying tree from the noisy data.
statistics & probability
What problem does this paper attempt to address?