BEHRT: Transformer for Electronic Health Records

Yikuan Li,Shishir Rao,Jose Roberto Ayala Solares,Abdelaali Hassaine,Dexter Canoy,Yajie Zhu,Kazem Rahimi,Gholamreza Salimi-Khorshidi
DOI: https://doi.org/10.48550/arXiv.1907.09538
2019-07-23
Abstract:Today, despite decades of developments in medicine and the growing interest in precision healthcare, vast majority of diagnoses happen once patients begin to show noticeable signs of illness. Early indication and detection of diseases, however, can provide patients and carers with the chance of early intervention, better disease management, and efficient allocation of healthcare resources. The latest developments in machine learning (more specifically, deep learning) provides a great opportunity to address this unmet need. In this study, we introduce BEHRT: A deep neural sequence transduction model for EHR (electronic health records), capable of multitask prediction and disease trajectory mapping. When trained and evaluated on the data from nearly 1.6 million individuals, BEHRT shows a striking absolute improvement of 8.0-10.8%, in terms of Average Precision Score, compared to the existing state-of-the-art deep EHR models (in terms of average precision, when predicting for the onset of 301 conditions). In addition to its superior prediction power, BEHRT provides a personalised view of disease trajectories through its attention mechanism; its flexible architecture enables it to incorporate multiple heterogeneous concepts (e.g., diagnosis, medication, measurements, and more) to improve the accuracy of its predictions; and its (pre-)training results in disease and patient representations that can help us get a step closer to interpretable predictions.
Machine Learning
What problem does this paper attempt to address?
This paper aims to solve the problem of the accuracy of disease prediction in electronic health records (EHR). Specifically, the authors propose a deep neural sequence - to - sequence transformation model named BEHRT (BERT for EHR) for multi - task prediction of EHR data and disease trajectory mapping. By introducing the attention mechanism, BEHRT can provide a personalized view of disease trajectories, and its flexible architecture can integrate multiple heterogeneous concepts (such as diagnosis, drugs, measurements, etc.) to improve the accuracy of prediction. In addition, the disease and patient representations generated during the pre - training process of BEHRT contribute to achieving interpretable predictions. The main contributions of the paper are as follows: 1. **Improving prediction accuracy**: When predicting the onset of 301 diseases, BEHRT improves the average precision score by 8.0 - 10.8% compared to the existing state - of - the - art deep EHR models. 2. **Personalized disease trajectories**: Through the attention mechanism, BEHRT can provide a personalized view of disease trajectories, helping doctors and patients better understand the development process of diseases. 3. **Multimodal data processing**: The flexible architecture of BEHRT can integrate multiple heterogeneous concepts and improve the accuracy of prediction. 4. **Interpretability**: The disease and patient representations generated from the pre - training results contribute to achieving interpretable predictions, making the model more transparent and trustworthy. In conclusion, by proposing the BEHRT model, this paper aims to use deep - learning techniques to improve the accuracy of disease prediction in electronic health records and provide more personalized medical advice.