Towards The Automatic Coding of Medical Transcripts to Improve Patient-Centered Communication

Gilchan Park,Julia Taylor Rayz,Cleveland G. Shields
DOI: https://doi.org/10.48550/arXiv.2109.10514
2021-09-22
Computation and Language
Abstract:This paper aims to provide an approach for automatic coding of physician-patient communication transcripts to improve patient-centered communication (PCC). PCC is a central part of high-quality health care. To improve PCC, dialogues between physicians and patients have been recorded and tagged with predefined codes. Trained human coders have manually coded the transcripts. Since it entails huge labor costs and poses possible human errors, automatic coding methods should be considered for efficiency and effectiveness. We adopted three machine learning algorithms (Na\"ive Bayes, Random Forest, and Support Vector Machine) to categorize lines in transcripts into corresponding codes. The result showed that there is evidence to distinguish the codes, and this is considered to be sufficient for training of human annotators.
What problem does this paper attempt to address?