Abstract:Discovery of biomedical named entities is one of the preliminary steps for many biomedical texts mining task. In the biomedical domain, typical entities are present, including disease, chemical, gene, and protein. To find these entities, currently, a deep learning-based approach applied into the Biomedical Named Entity Recognition (Bio_NER) which gives prominent results. Although deep learning-based approach gives a satisfactory result, still a tremendous amount of data is required for training because a lack of data can be one of the barriers in the performance of Bio_NER. There is one more obstacle in the path of Bio_NER is polysemy or misclassification of the entity in bio-entity. Which means one biomedical entity might have a different meaning in different places, i.e., a gene named entity may be labeled as disease name. When Conditional Random Field combined with deep learning-based approach i.e. Bidirectional Long Short Term Memory (Bi-LSTM), It mistakenly labeled a gene entity “BRCA1” as a disease entity which is “BRCA1 abnormality” or “Braca1-deficient” present in the training dataset. Similarly, “VHL (Von Hippel-Lindau disease),” which is one of the genes named labeled as a disease by Bi-LSTM CRF Model. One more problem is addressed in this chapter, as bio-med domain, entities are long and complex like cell whose name is “A375M (B-Raf (V600E)) is a human melanoma cell line”, in this biomedical entity, multiple words are present, but still it is difficult to find the context information of this particular bio-entity. For lack of data and entity misclassification problem, this chapter embeds multiple Bio_NER models. In the proposed model, the model trained with different datasets is connected so that the targeted model obtained the information by combining another model, which reduce the false-positives rate. Recurrent Neural Network (RNN) which is dependent upon the Bi-LSTM gates are introduced to handle the long and complex range dependencies in biomedical entities. BioCreative II GM Corpus, Pubmed, Gold-standard dataset, and JNLPBA dataset are used in this research work.

Semi-supervised deep learning based named entity recognition model to parse education section of resumes

Named entity recognition in resumes

Named Entity Recognition based Resume Parser and Summarizer

Semi-supervised Bootstrapping approach for Named Entity Recognition

Named Entity Recognition for English Language Using Deep Learning Based Bi Directional LSTM-RNN

Named entity recognition based on semi-supervised ensemble learning with the improved tri-training algorithm

Machine Learned Resume-Job Matching Solution

Resume Information Extraction via Post-OCR Text Processing

AI Resume Analyzer Using Natural Language Processing and Data Mining

Advancements in Named Entity Recognition using Deep Learning Techniques: A Comprehensive Study on Emerging Trends

Language model based on deep learning network for biomedical named entity recognition

ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

A Machine Learning and NLP Approach for Analyzing Eligibility Based on Resume and CV

Reinforcement learning based distantly supervised biomedical named entity recognition

Online biomedical named entities recognition by data and knowledge-driven model

Towards Bootstrapping Biomedical Named Entity Recognition Using Reinforcement Learning

DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

Disambiguation Model for Bio-Medical Named Entity Recognition

A domain adaptation approach for resume classification using graph attention networks and natural language processing

Resume Validation and Filtration using Natural Language Processing

RESUME RECOMMENDATION USING MACHINE LEARNING