Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn how to leverage OpenAI's Whisper, a powerful speech-to-text AI model, to automatically generate accurate subtitles for YouTube videos in multiple languages. Explore a Python-based solution that downloads YouTube video audio, transcribes content using transformer architecture, and creates SRT subtitle files ready for upload. Discover the process of fine-tuning Whisper for languages not included in the base model through a provided Jupyter Notebook. Master the implementation of this open-source tool, released in September 2022, to enhance video accessibility and reach a global audience with professionally transcribed subtitles. Access comprehensive resources including scientific documentation, HuggingFace model implementations, and detailed fine-tuning guides to optimize the speech-to-text conversion process for your specific needs.
Syllabus
Whisper for YouTube: Speech2TextAI: Perfect YouTube Subtitles - Free. Multiple Languages.
Taught by
Discover AI