Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Preparing the Speech Dataset

Valerio Velardo - The Sound of AI via YouTube

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Duration & workload

37 minutes
Sessions

On-Demand

Found in

Audio Processing Courses

Overview

Learn how to pre-process a voice dataset by extracting Mel-frequency cepstral coefficients (MFCCs) and saving them in a JSON file in this 37-minute tutorial video. Explore the Speech Commands Dataset and follow along with the provided code to prepare your audio data for deep learning applications. Gain insights into dataset overview, prerequisites, data dictionary creation, and efficient storage techniques. Perfect for those interested in audio processing and machine learning for speech recognition tasks.

Syllabus

Introduction
Speech Dataset
Dataset Overview
Preparing the Dataset
Prerequisites
Data Dictionary
Loop Free
Magic
Labels
Storage
Review
Store
Outro

Taught by

Valerio Velardo - The Sound of AI

Reviews

Start your review of Preparing the Speech Dataset

Start learning

Taught by

Music Genre Classification - Preparing the Dataset

Mel-Frequency Cepstral Coefficients Explained Easily

Never Stop Learning.