Multilingual Video Viewing Subtitle to Audio Translator

Namya Sushil
DOI: https://doi.org/10.22214/ijraset.2024.61621
2024-05-31
International Journal for Research in Applied Science and Engineering Technology
Abstract:Abstract: The research outlines an innovative approach to translate and synthesize audio content from English to any language, enhancing accessibility and knowledge acquisition for diverse linguistic communities. It focuses on improving language comprehension among various groups in India, particularly aiding visually impaired individuals. The approach incorporates Hugging Face Transformers for more precise language translation alongside Google Text-to-Speech for audio conversion. Additionally, Pydub and FFmpeg commands efficiently handle audio processing tasks, while Pytube facilitates YouTube video downloads. The system includes a subtitle generation feature that synchronizes subtitles with translated audio chunks, offering a comprehensive multilingual viewing experience. This implementation emphasizes language translation, audio manipulation, video processing, and subtitle generation functionalities, all integrated with robust error handling and multithreading capabilities. This showcases significant progress in accessible andmultilingual content delivery, reaffirming our commitment to inclusive knowledge dissemination and communication.
What problem does this paper attempt to address?