Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Open AI's Whisper Is Amazing

sentdex via YouTube

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Duration & workload

26 minutes
Sessions

On-Demand

Found in

Overview

Explore OpenAI's Whisper, a groundbreaking speech-to-text model capable of transcribing and translating 97 languages. Learn about its weakly supervised encoder-decoder transformer architecture, trained on 680,000 hours of audio. Discover the model's implementation, fine-tuning process, and multitask capabilities. Delve into topics such as data quality, pipeline structure, generalization, overfitting prevention, and the impact of model size on performance. Gain insights into the weekly supervise technique and how mixing tasks contributes to the model's versatility.

Syllabus

Intro
What is Whisper
Example Implementation
Weekly supervise
Finetuning
Mixing Tasks
Data Quality
Model
Pipeline
Generalization
Overfitting
Model size
Multitask performance

Taught by

sentdex

Reviews

Start your review of Open AI's Whisper Is Amazing

Start learning

Taught by

Building Better Language Models - Paradigms and Techniques

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.