Multilingual End to End Entity Linking

Mikhail Plekhanov,Nora Kassner,Kashyap Popat,Louis Martin,Simone Merello,Borislav Kozlovskii,Frédéric A. Dreyer,Nicola Cancedda
2023-06-15
Abstract:Entity Linking is one of the most common Natural Language Processing tasks in practical applications, but so far efficient end-to-end solutions with multilingual coverage have been lacking, leading to complex model stacks. To fill this gap, we release and open source BELA, the first fully end-to-end multilingual entity linking model that efficiently detects and links entities in texts in any of 97 languages. We provide here a detailed description of the model and report BELA's performance on four entity linking datasets covering high- and low-resource languages.
Computation and Language
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address several key issues in the task of Entity Linking (EL): 1. **Multilingual Coverage**: There is currently a lack of efficient, unified solutions to handle entity linking tasks across multiple languages, leading to the need for complex model stacks. This paper introduces BELA (Bi-encoder Entity Linking Architecture), the first model capable of detecting and linking entities in 97 languages in a unified manner. 2. **High Efficiency**: Entity linking systems in practical applications require not only high accuracy but also high speed and broad coverage. BELA employs a bi-encoder architecture, completing the entity linking of an entire text segment in a single forward pass, regardless of the number of entity mentions it contains. 3. **End-to-End Solution**: Existing multilingual entity linking systems are often not end-to-end. BELA combines advancements across multiple dimensions, including end-to-end but single-language architectures, Transformer-based architectures, and single-pass end-to-end architectures, and extends them further. By achieving these goals, BELA aims to simplify and improve entity-centric applications such as information extraction, content moderation, and question-answering systems.