Bi-LSTM-CRF Sequence Labeling for Keyphrase Extraction from Scholarly Documents

Rabah Alzaidy,Cornelia Caragea,C. Lee Giles
DOI: https://doi.org/10.1145/3308558.3313642
2019-01-01
Abstract:In this paper, we address the keyphrase extraction problem as sequence labeling and propose a model that jointly exploits the complementary strengths of Conditional Random Fields that capture label dependencies through a transition parameter matrix consisting of the transition probabilities from one label to the neighboring label, and Bidirectional Long Short Term Memory networks that capture hidden semantics in text through the long distance dependencies. Our results on three datasets of scholarly documents show that the proposed model substantially outperforms strong baselines and previous approaches for keyphrase extraction.
What problem does this paper attempt to address?