Artificial Intelligence Based Data Governance for Chinese Electronic Health Record Analysis

Sen Lin,J. Zhong,X. Yi,Panpan Wang,Jian Wang,Z. Shao
DOI: https://doi.org/10.5121/IJDKP.2018.8303
2018-05-30
International Journal of Data Mining & Knowledge Management Process
Abstract:Electronic health record (EHR) analysis can leverage great insights to improve the quality of human healthcare. However, the low data quality problems of missing values, inconsistency, and errors in the data setseverely hinder buildingrobust machine learning models for data analysis. In this paper, we develop a methodology ofartificial intelligence (AI)-based data governance to predict the missing values or verify if the existing values are correct and what they should be when they are wrong. We demonstrate the performance of this methodology through a case study ofpatient gender prediction and verification. Experimental resultsshow that the deep learning algorithm of convolutional neural network (CNN) works very wellaccording to the testing performance measured by the quantitative metric of F1-Score, and it outperformsthe support vector machine (SVM) models with different vector representations for documents.
Medicine,Computer Science
What problem does this paper attempt to address?