Abstract:The aim of this work is to develop efficient named entity recognition from the given text that in turn improves the performance of the systems that use natural language processing (NLP). The performance of IoT-based devices such as Alexa and Cortana significantly depends upon an efficient NLP model. To increase the capability of the smart IoT devices in comprehending the natural language, named entity recognition (NER) tools play an important role in these devices. In general, the NER is a two-step process that initially the proper nouns are identified from text and then classify them into predefined categories of entities such as person, location, measure, organization and time. NER is often performed as a subtask while processing natural languages which increases the accuracy level of a NLP task. In this paper, we propose deep neural network architecture for named entity recognition for the resource-scarce language Hindi, based on convolutional neural network (CNN), bidirectional long short-term memory (Bi-LSTM) neural network and conditional random field (CRF). In the proposed approach, initially, we use skip-gram word2vec model and GloVe model to represent words in semantic vectors which are further used in different deep neural network-based architectures. In the proposed approach, we use character- and word-level embedding to represent the text that includes information at fine-grained level. Due to the use of character-level embeddings, the proposed model is robust for the out-of-vocabulary words. Experimental results show that the combination of Bi-LSTM, CNN and CRF algorithms performs better as compared to the other baseline methods such as recurrent neural network, long short-term memory and Bi-LSTM individually.

Using Data Augmentation and Bidirectional Encoder Representations from Transformers for Improving Punjabi Named Entity Recognition

Enriching Urdu NER with BERT Embedding, Data Augmentation, and Hybrid Encoder-CNN Architecture

Enhancing Low Resource NER Using Assisting Language And Transfer Learning

A deep neural network-based model for named entity recognition for Hindi language

Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages

A deep learning approach for Named Entity Recognition in Urdu language

A novel Data and Model Centric artificial intelligence based approach in developing high-performance Named Entity Recognition for Bengali Language

DATG: Data Augmentation with Transformer-Based Generation for Low-Resource Named Entity Recognition

Named Entity Recognition for English Language Using Deep Learning Based Bi Directional LSTM-RNN

Data Augmentation for Cross-Domain Named Entity Recognition

Mono vs Multilingual BERT: A Case Study in Hindi and Marathi Named Entity Recognition

Gazetteer-Enhanced Bangla Named Entity Recognition with BanglaBERT Semantic Embeddings K-Means-Infused CRF Model

Named Entity Recognition for Nepali Language

On Significance of Subword tokenization for Low Resource and Efficient Named Entity Recognition: A case study in Marathi

Improving the Quality of MT Output using Novel Name Entity Translation Scheme

In domain training data augmentation on noise robust Punjabi Children speech recognition

Cascaded Models for Better Fine-Grained Named Entity Recognition

Bridging the Gap: Transfer Learning from English PLMs to Malaysian English

An improved data augmentation approach and its application in medical named entity recognition

Towards Malay named entity recognition: an open-source dataset and a multi-task framework

Long Range Named Entity Recognition for Marathi Documents