Reverb: Open-Source ASR and Diarization from Rev

Nishchal Bhandari,Danny Chen,Miguel Ángel del Río Fernández,Natalie Delworth,Jennifer Drexler Fox,Migüel Jetté,Quinten McNamara,Corey Miller,Ondřej Novotný,Ján Profant,Nan Qin,Martin Ratajczak,Jean-Philippe Robichaud
2024-10-05
Abstract:Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use. We are releasing both a full production pipeline for developers as well as pared-down research models for experimentation. Rev hopes that these releases will spur research and innovation in the fast-moving domain of voice technology. The speech recognition models released today outperform all existing open source speech recognition models across a variety of long-form speech recognition domains.
Computation and Language,Sound,Audio and Speech Processing
What problem does this paper attempt to address?