Combining Neural and Knowledge-Based Approaches to Named Entity Recognition in Polish

Sławomir Dadas
DOI: https://doi.org/10.1007/978-3-030-20912-4_4
2019-01-01
Abstract:Named entity recognition (NER) is one of the tasks in natural language processing that can greatly benefit from the use of external knowledge sources. We propose a named entity recognition framework composed of knowledge-based feature extractors and a deep learning model including contextual word embeddings, long short-term memory (LSTM) layers and conditional random fields (CRF) inference layer. We use an entity linking module to integrate our system with Wikipedia. The combination of effective neural architecture and external resources allows us to obtain state-of-the-art results on recognition of Polish proper names. We evaluate our model on the data from PolEval 2018 (http://2018.poleval.pl/) NER challenge on which it outperforms other methods, reducing the error rate by 22.4% compared to the winning solution.
What problem does this paper attempt to address?