High Fidelity Neural Audio Compression - Paper & Code Explained
Aleksa Gordić - The AI Epiphany via YouTube
Overview
Syllabus
Intro
Paper walk-through: high level overview
Residual Vector Quantization
Reducing the BW using arithmetic coding and transformers
Loss formulations and results
Code walk-through
EnCodec architecture
Residual Vector Quantizer module
Loading the audio signal
Compression - a forward pass through the encoder
Quantization forward pass
Efficiently packing the bits
Using LM to further compress audio
Outro
Taught by
Aleksa Gordić - The AI Epiphany