Abstract:Background: Negation and speculation unrelated to abnormal findings can lead to false-positive alarms for automatic radiology report highlighting or flagging by laboratory information systems. Objective: This internal validation study evaluated the performance of natural language processing methods (NegEx, NegBio, NegBERT, and transformers). Methods: We annotated all negative and speculative statements unrelated to abnormal findings in reports. In experiment 1, we fine-tuned several transformer models (ALBERT [A Lite Bidirectional Encoder Representations from Transformers], BERT [Bidirectional Encoder Representations from Transformers], DeBERTa [Decoding-Enhanced BERT With Disentangled Attention], DistilBERT [Distilled version of BERT], ELECTRA [Efficiently Learning an Encoder That Classifies Token Replacements Accurately], ERNIE [Enhanced Representation through Knowledge Integration], RoBERTa [Robustly Optimized BERT Pretraining Approach], SpanBERT, and XLNet) and compared their performance using precision, recall, accuracy, and F1-scores. In experiment 2, we compared the best model from experiment 1 with 3 established negation and speculation-detection algorithms (NegEx, NegBio, and NegBERT). Results: Our study collected 6000 radiology reports from 3 branches of the Chi Mei Hospital, covering multiple imaging modalities and body parts. A total of 15.01% (105,755/704,512) of words and 39.45% (4529/11,480) of important diagnostic keywords occurred in negative or speculative statements unrelated to abnormal findings. In experiment 1, all models achieved an accuracy of >0.98 and F1-score of >0.90 on the test data set. ALBERT exhibited the best performance (accuracy=0.991; F1-score=0.958). In experiment 2, ALBERT outperformed the optimized NegEx, NegBio, and NegBERT methods in terms of overall performance (accuracy=0.996; F1-score=0.991), in the prediction of whether diagnostic keywords occur in speculative statements unrelated to abnormal findings, and in the improvement of the performance of keyword extraction (accuracy=0.996; F1-score=0.997). Conclusions: The ALBERT deep learning method showed the best performance. Our results represent a significant advancement in the clinical applications of computer-aided notification systems.

Scope resolution of predicted negation cues: A two-step neural network-based approach

A Syntactic Path-Based Hybrid Neural Network for Negation Scope Detection

Negation Scope Detection with Recurrent Neural Networks Models in Review Texts.

Negation scope detection with a conditional random field model

Speculation and negation identification via unified Machine Reading Comprehension frameworks with lexical and syntactic data augmentation

Negation Scope Resolution for Chinese As a Second Language

ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning

Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

Learn to Understand Negation in Video Retrieval

A Neural Network Approach to Verb Phrase Ellipsis Resolution

Revisiting Negation in Neural Machine Translation

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting

Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction

NevIR: Negation in Neural Information Retrieval

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

Binary and Multitask Classification Model for Dutch Anaphora Resolution: Die/Dat Prediction

Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods

NegatER: Unsupervised Discovery of Negatives in Commonsense Knowledge Bases

Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations

Probing structural constraints of negation in Pretrained Language Models