Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Understanding Audio Data for Deep Learning

Valerio Velardo - The Sound of AI via YouTube

Overview

Explore fundamental concepts of audio digital processing essential for deep learning applications in this 33-minute video. Delve into waveforms, pitch, loudness, Fourier transform, spectrograms, and MFCCs. Gain insights into sound metrics, short-time Fourier transform, preprocessing pipelines, and MFCC representations and applications. Access accompanying slides and additional resources for a comprehensive understanding of audio data analysis in the context of deep learning.

Syllabus

Intro
What is sound
Metrics
Fourier transform
Shorttime Fourier transform
Preprocessing pipeline
M FCCs
M FCC representation
M FCC applications
Preprocessing
Outro

Taught by

Valerio Velardo - The Sound of AI

Reviews

Start your review of Understanding Audio Data for Deep Learning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.