Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Real-Time Live Speech-to-Text - Streaming ASR Gradio App with Hugging Face Tutorial

1littlecoder via YouTube

Overview

Build a real-time automatic speech recognition system using Facebook's Wav2Vec2 deep learning model in this applied NLP tutorial. Learn to implement Hugging Face Transformers Pipeline for audio-to-text conversion and create a Python web app with Gradio for live audio transcription. Explore pipeline setup, UI interface components, and state management. Access the provided Colab notebook for hands-on practice and discover related resources, including a guide on deploying Gradio ML apps on Hugging Face Spaces and a detailed blog post on real-time speech recognition. Enhance your NLP skills with additional tutorials, such as YouTube video transcript summarization using Hugging Face Transformers.

Syllabus

Introduction
Pipeline
UI
Interface Components
State

Taught by

1littlecoder

Reviews

Start your review of Real-Time Live Speech-to-Text - Streaming ASR Gradio App with Hugging Face Tutorial

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.