Weakest link pruning of a dendrogram

Jiacheng Ge,Robert Tibshirani
DOI: https://doi.org/10.48550/arXiv.2212.05367
2022-12-10
Methodology
Abstract:Hierarchical clustering is a popular method for identifying distinct groups in a dataset. The most commonly used method for pruning a dendrogram is via a single horizontal cut. In this paper, we propose a new technique "weakest link optimal pruning". We prove its superiority over horizontal pruning and provide some examples illustrating how the two methods can behave quite differently.
What problem does this paper attempt to address?