Open source platform for Estonian speech transcription

Olev, Aivo,Alumäe, Tanel
DOI: https://doi.org/10.1007/s10579-024-09777-1
2024-10-17
Language Resources and Evaluation
Abstract:This paper presents our progress in developing and maintaining a public speech and speaker recognition platform for the Estonian language. The platform consists of a speech processing pipeline and a web-based user interface for end-users, offering transcript post-editing functionality. It is offered for free as a public service and is in active use. The service provides significantly higher speech recognition accuracy than commercial alternatives. We discuss the switch to a workflow management system and how it has improved the core speech processing pipeline. The core systems behind the platform have been made available as open-source code and deployed internally by multiple public and private institutions.
computer science, interdisciplinary applications
What problem does this paper attempt to address?