Abstract:Background: Named entity recognition (NER) models are essential for extracting structured information from unstructured medical texts by identifying entities such as diseases, treatments, and conditions, enhancing clinical decision-making and research. Innovations in machine learning, particularly those involving Bidirectional Encoder Representations From Transformers (BERT)–based deep learning and large language models, have significantly advanced NER capabilities. However, their performance varies across medical datasets due to the complexity and diversity of medical terminology. Previous studies have often focused on overall performance, neglecting specific challenges in medical contexts and the impact of macrofactors like lexical composition on prediction accuracy. These gaps hinder the development of optimized NER models for medical applications. Objective: This study aims to meticulously evaluate the performance of various NER models in the context of medical text analysis, focusing on how complex medical terminology affects entity recognition accuracy. Additionally, we explored the influence of macrofactors on model performance, seeking to provide insights for refining NER models and enhancing their reliability for medical applications. Methods: This study comprehensively evaluated 7 NER models—hidden Markov models, conditional random fields, BERT for Biomedical Text Mining, Big Transformer Models for Efficient Long-Sequence Attention, Decoding-enhanced BERT with Disentangled Attention, Robustly Optimized BERT Pretraining Approach, and Gemma—across 3 medical datasets: Revised Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA), BioCreative V CDR, and Anatomical Entity Mention (AnatEM). The evaluation focused on prediction accuracy, resource use (eg, central processing unit and graphics processing unit use), and the impact of fine-tuning hyperparameters. The macrofactors affecting model performance were also screened using the multilevel factor elimination algorithm. Results: The fine-tuned BERT for Biomedical Text Mining, with balanced resource use, generally achieved the highest prediction accuracy across the Revised JNLPBA and AnatEM datasets, with microaverage (AVG_MICRO) scores of 0.932 and 0.8494, respectively, highlighting its superior proficiency in identifying medical entities. Gemma, fine-tuned using the low-rank adaptation technique, achieved the highest accuracy on the BioCreative V CDR dataset with an AVG_MICRO score of 0.9962 but exhibited variability across the other datasets (AVG_MICRO scores of 0.9088 on the Revised JNLPBA and 0.8029 on AnatEM), indicating a need for further optimization. In addition, our analysis revealed that 2 macrofactors, entity phrase length and the number of entity words in each entity phrase, significantly influenced model performance. Conclusions: This study highlights the essential role of NER models in medical informatics, emphasizing the imperative for model optimization via precise data targeting and fine-tuning. The insights from this study will notably improve clinical decision-making and facilitate the creation of more sophisticated and effective medical NER models.

Accurate Name Entity Recognition for Biomedical Literatures: A Combined High-quality Manual Annotation and Deep-learning Natural Language Processing Study

A Combined Manual Annotation and Deep-Learning Natural Language Processing Study on Accurate Entity Extraction in Hereditary Disease Related Biomedical Literature

Research on Named Entity Recognition from Biomedical Literature

Advancing entity recognition in biomedicine via instruction tuning of large language models

Biomedical named entity recognition using BERT in the machine reading comprehension framework

Partial Annotation Learning for Biomedical Entity Recognition

Named Entity Recognition in Chinese Medical Literature Using Pretraining Models

Language model based on deep learning network for biomedical named entity recognition

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study

A BIGRU-Based Stacked Attention Network for Biomedical Named Entity Recognition with Chinese EMRs

BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition

Recognizing Names in Biomedical Texts: a Machine Learning Approach

Improving dictionary-based named entity recognition with deep learning

Improving the recall of biomedical named entity recognition with label re-correction and knowledge distillation

Development of Biomedical Corpus Enlargement Platform Using BERT for Bio-entity Recognition

A comparative study for biomedical named entity recognition

Named Entity Recognition from Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF.

A hybrid deep-learning approach for complex biochemical named entity recognition

BioMNER: A Dataset for Biomedical Method Entity Recognition

BERT-Based Models with Attention Mechanism and Lambda Layer for Biomedical Named Entity Recognition

Accurate Medical Named Entity Recognition Through Specialized NLP Models