Abstract:The weighted ancestor problem on a rooted node-weighted tree $T$ is a generalization of the classic predecessor problem: construct a data structure for a set of integers that supports fast predecessor queries. Both problems are known to require $\Omega(\log\log n)$ time for queries provided $\mathcal{O}(n\text{ poly} \log n)$ space is available, where $n$ is the input size. The weighted ancestor problem has attracted a lot of attention by the combinatorial pattern matching community due to its direct application to suffix trees. In this formulation of the problem, the nodes are weighted by string depth. This research has culminated in a data structure for weighted ancestors in suffix trees with $\mathcal{O}(1)$ query time and an $\mathcal{O}(n)$-time construction algorithm [Belazzougui et al., CPM 2021]. In this paper, we consider a different version of the weighted ancestor problem, where the nodes are weighted by any function $\textsf{weight}$ that maps the nodes of $T$ to positive integers, such that $\textsf{weight}(u)\le \textsf{size}(u)$ for any node $u$ and $\textsf{weight}(u_1)\le \textsf{weight}(u_2)$ if node $u_1$ is a descendant of node $u_2$, where $\textsf{size}(u)$ is the number of nodes in the subtree rooted at $u$. In the size-constrained weighted ancestor (SWA) problem, for any node $u$ of $T$ and any integer $k$, we are asked to return the lowest ancestor $w$ of $u$ with weight at least $k$. We show that for any rooted tree with $n$ nodes, we can locate node $w$ in $\mathcal{O}(1)$ time after $\mathcal{O}(n)$-time preprocessing. In particular, this implies a data structure for the SWA problem in suffix trees with $\mathcal{O}(1)$ query time and $\mathcal{O}(n)$-time preprocessing, when the nodes are weighted by $\textsf{weight}$. We also show several string-processing applications of this result.

Compact Ancestry Labeling Schemes for Trees of Small Depth

Near-optimal labeling schemes for nearest common ancestors

Branch Code: A Labeling Scheme for Efficient Query Answering on Trees

Nearest Common Ancestors: Universal Trees and Improved Labeling Schemes

Universal rooted phylogenetic tree shapes and universal tanglegrams

Towards a complete perspective on labeled tree indexing: new size bounds, efficient constructions, and beyond

Two Metrics on Rooted Unordered Trees with Labels

A Scalable Method for Readable Tree Layouts

A lattice structure for ancestral configurations arising from the relationship between gene trees and species trees

Size-constrained Weighted Ancestors with Applications

Fully-Functional Static and Dynamic Succinct Trees

Species, Clusters and the 'Tree of Life': A graph-theoretic perspective

On Two Measures of Distance between Fully-Labelled Trees

Computing Rooted and Unrooted Maximum Consistent Supertrees

Exponentially Huge Natural Deduction proofs are Redundant: Preliminary results on $M_\supset$

Enumeration of labeled trees and Dyck tilings

An Improved Lower Bound on the Largest Common Subtree of Random Leaf-Labeled Binary Trees

On the enumeration of leaf-labelled increasing trees with arbitrary node-degree

Edit Distance between Unlabeled Ordered Trees

`Lassoing' a phylogenetic tree I: Basic properties, shellings, and covers

Dynamic "Succincter"