Taxonomy Induction Using Hypernym Subsequences

Amit Gupta,Rémi Lebret,Hamza Harkous,Karl Aberer
DOI: https://doi.org/10.1145/3132847.3133041
2017-11-06
Abstract:We propose a novel, semi-supervised approach towards domain taxonomy induction from an input vocabulary of seed terms. Unlike all previous approaches, which typically extract direct hypernym edges for terms, our approach utilizes a novel probabilistic framework to extract hypernym subsequences. Taxonomy induction from extracted subsequences is cast as an instance of the minimum-cost flow problem on a carefully designed directed graph. Through experiments, we demonstrate that our approach outperforms state-of-the-art taxonomy induction approaches across four languages. Importantly, we also show that our approach is robust to the presence of noise in the input vocabulary. To the best of our knowledge, this robustness has not been empirically proven in any previous approach.
What problem does this paper attempt to address?