Abstract:BACKGROUND:Medical event detection in narrative clinical notes of electronic health records (EHRs) is a task designed for reading text and extracting information. Most of the previous work of medical event detection treats the task as extracting concepts at word granularity, which omits the overall structural information of the clinical notes. In this work, we treat each clinical note as a sequence of short sentences and propose an end-to-end deep neural network framework.METHODS:We redefined the task as a sequence labelling task at short sentence granularity, and proposed a novel tag system correspondingly. The dataset were derived from a third-level grade-A hospital, consisting of 2000 annotated clinical notes according to our proposed tag system. The proposed end-to-end deep neural network framework consists of a feature extractor and a sequence labeller, and we explored different implementations respectively. We additionally proposed a smoothed Viterbi decoder as sequence labeller without additional parameter training, which can be a good alternative to conditional random field (CRF) when computing resources are limited.RESULTS:Our sequence labelling models were compared to four baselines which treat the task as text classification of short sentences. Experimental results showed that our approach significantly outperforms the baselines. The best result was obtained by using the convolutional neural networks (CNNs) feature extractor and the sequential CRF sequence labeller, achieving an accuracy of 92.6%. Our proposed smoothed Viterbi decoder achieved a comparable accuracy of 90.07% with reduced training parameters, and brought more balanced performance across all categories, which means better generalization ability.CONCLUSIONS:Evaluated on our annotated dataset, the comparison results demonstrated the effectiveness of our approach for medical event detection in Chinese clinical notes of EHRs. The best feature extractor is the CNNs feature extractor, and the best sequence labeller is the sequential CRF decoder. And it was empirically verified that our proposed smoothed Viterbi decoder could bring better generalization ability while achieving comparable performance to the sequential CRF decoder.

Deep Learning Based Information Extraction Framework on Chinese Electronic Health Records

Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining

A unified framework of medical information annotation and extraction for Chinese clinical text

Chinese Clinical Named Entity Recognition with Word-Level Information Incorporating Dictionaries

Named Entity Extraction for Chinese Electronic Medical Records.

An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records

Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.

Character-Based Deep Learning Approaches for Clinical Named Entity Recognition: A Comparative Study Using Chinese EHR Texts.

An Approach for Medical Event Detection in Chinese Clinical Notes of Electronic Health Records

Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach

Named Entity Recognition in Chinese Electronic Medical Records Based on CRF.

Information Extraction of Chinese Medical Electronic Records Via Evolutionary Neural Architecture Search

Temporal Expression Recognition and Temporal Relationship Extraction from Chinese Narrative Medical Records

Using Deep Learning Based Natural Language Processing Techniques for Clinical Decision-Making with EHRs

A Hybrid Approach for Named Entity Recognition in Chinese Electronic Medical Record

A query interface for clinical research with Chinese electronic health record using Natural Language Processing

A self-attention based neural architecture for Chinese medical named entity recognition

An Automated Approach For Clinical Quantitative Information Extraction From Chinese Electronic Medical Records

A Neural Framework for Chinese Medical Named Entity Recognition.

Extracting Entities with Attributes in Clinical Text Via Joint Deep Learning.