Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

High Fidelity Neural Audio Compression - Paper & Code Explained

Aleksa Gordić - The AI Epiphany via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into a comprehensive video explanation of the "High Fidelity Neural Audio Compression" paper and its accompanying code. Explore cutting-edge techniques that achieve 10x compression rates compared to mp3, with audio quality maintained at just 6 kbps. Learn about advanced concepts like VQ-VAE, VQ-GAN, and AudioGen applied to audio compression. Follow along with a detailed paper walk-through, code analysis, and in-depth explanations of key components such as Residual Vector Quantization, EnCodec architecture, and efficient bit packing. Gain insights into the potential impact on internet traffic reduction and the future of audio streaming technology.

Syllabus

Intro
Paper walk-through: high level overview
Residual Vector Quantization
Reducing the BW using arithmetic coding and transformers
Loss formulations and results
Code walk-through
EnCodec architecture
Residual Vector Quantizer module
Loading the audio signal
Compression - a forward pass through the encoder
Quantization forward pass
Efficiently packing the bits
Using LM to further compress audio
Outro

Taught by

Aleksa Gordić - The AI Epiphany

Reviews

Start your review of High Fidelity Neural Audio Compression - Paper & Code Explained

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.