Toward Automated Clinical Transcriptions

Mitchell A. Klusty,W. Vaiden Logan,Samuel E. Armstrong,Aaron D. Mullen,Caroline N. Leach,Jeff Talbert,V. K. Cody Bumgardner
2024-09-20
Abstract:Administrative documentation is a major driver of rising healthcare costs and is linked to adverse outcomes, including physician burnout and diminished quality of care. This paper introduces a secure system that applies recent advancements in speech-to-text transcription and speaker-labeling (diarization) to patient-provider conversations. This system is optimized to produce accurate transcriptions and highlight potential errors to promote rapid human verification, further reducing the necessary manual effort. Applied to over 40 hours of simulated conversations, this system offers a promising foundation for automating clinical transcriptions.
Audio and Speech Processing,Artificial Intelligence,Computation and Language,Sound
What problem does this paper attempt to address?
The paper aims to address the automation of clinical documentation in the medical field. Specifically, it introduces a secure system that utilizes the latest speech-to-text technology and speaker diarization technology to automatically process recordings of conversations between patients and doctors. The main objectives of this system include: 1. **Improving transcription accuracy**: By generating high-precision transcriptions through advanced speech recognition models (such as OpenAI's Whisper) and distinguishing different speakers using speaker diarization tools (such as PyAnnote), the system reduces the workload of manual verification. 2. **Simplifying the manual verification process**: The system is designed to quickly annotate potential errors, facilitating the manual verification process to ensure the accuracy of the final document. 3. **Ensuring data security**: The system employs security measures such as end-to-end encryption to protect sensitive medical information, and all processing is conducted on local servers to avoid the risk of data breaches. 4. **Reducing the burden on healthcare professionals**: By automating most of the documentation tasks, the system allows doctors to spend more time on direct patient care, alleviating their administrative documentation workload. In summary, this research proposes a solution that integrates the latest AI technologies to achieve efficient, secure, and accurate automation of clinical conversation transcription, thereby improving the quality and efficiency of medical services.