Application of speech rate conversion technology to video editing : Allows up to 5 times normal speed playback while maintaining speech intelligibility

T. Takagi,T. Mishima,E. Miyasaka,Seiyama Nobumasa,Imai Atsushi
2001-10-01
Abstract:This paper describes an application of speech rate conversion technology to video editing. In video editing, it is common to search through the material at several times normal speed. The speech rate conversion technology maintains the original pitch and timbre of speech despite playing it back at a faster rate, which is varied adaptively to permit fast listening in real-time. In listening tests, users were able to comprehend speech played at up to 5 times normal speed which was incomprehensible without the adaptive rate conversion (even when pitch-shifted to restore the original pitch). A prototype editing system, which maintains the intelligibility of speech during variable-rate playback, has been applied to a non-linear editing system. The system can change the replay speed of MPEG1 or 2 audio and video simultaneously has been implemented on a personal computer.
Computer Science
What problem does this paper attempt to address?