Abstract:Recurrent Neural Networks (RNNs) and transformers are deep learning models that have achieved remarkable success in several Natural Language Processing (NLP) tasks since they do not rely on handcrafted features nor enormous knowledge resources. Named Entity Recognition (NER) is an essential NLP task that is used in many applications such as information retrieval, question answering, and machine translation. NER aims to locate, extract, and classify named entities into predefined categories such as person, organization and location. Arabic NER is considered a challenging task because of the complexity and the unique characteristics of Arabic. Most of the previous research on deep learning based-Arabic NER focused on Modern Standard Arabic and Dialectal Arabic, which are different variations from Classical Arabic. In this paper, we investigate deep learning-based Classical Arabic NER using different deep neural network architectures and a BERT based contextual language model that is trained on general domain Arabic text. We propose two RNN-based models by fine-tunning the pretrained BERT language model to recognize and classify named entities from Classical Arabic. The pre-trained BERT contextual language model representations were used as input features to a BGRU/BLSTM model and were fine-tuned using a Classical Arabic NER dataset. In addition, we explore variant architectures of the proposed BERT-BGRU/BLSTM-CRF models. Experimentations showed that the BERT-BGRU-CRF model outperformed the other models by achieving an F-measure of 94.76% on the CANERCorpus. To the best of our knowledge, this is the first work that aims to recognize named entities in Classical Arabic using deep learning.

Recognition and translation Arabic-French of Named Entities: case of the Sport places

Using LSTM and GRU With a New Dataset for Named Entity Recognition in the Arabic Language

Content-Localization based Neural Machine Translation for Informal Dialectal Arabic: Spanish/French to Levantine/Gulf Arabic

End-to-end named entity extraction from speech

Object Recognition System for the Visually Impaired: A Deep Learning Approach using Arabic Annotation

Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi

Classical Arabic Named Entity Recognition Using Variant Deep Neural Network Architectures and BERT

Impact of translation on biomedical information extraction from real-life clinical notes

Dealing with Metonymic Readings of Named Entities

Cross-Language Personal Name Mapping

Empirical Evaluation of Leveraging Named Entities for Arabic Sentiment Analysis

A Named Entity Recognition Approach for Albanian Using Deep Learning

A Sequence-to-Sequence Approach for Arabic Pronoun Resolution

English-Arabic Transliteration

Rule-Based Machine Translation from Tunisian Dialect to Modern Standard Arabic

Contribution au Niveau de l'Approche Indirecte à Base de Transfert dans la Traduction Automatique

A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition

Neural Named Entity Recognition from Subword Units

A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends

Similarity-based model for transliteration

A Benchmark Evaluation of Clinical Named Entity Recognition in French