Overview
Syllabus
Intro
Why Care About Low-Resource Speech Processing?
How Much Transcribed Audio Do We Need?
Why Do We Need All That Training Data?
Multilingual Features
The IARPA Babel Program
Babel Languages
Limited resources
What is keyword search, and why focus on it?
How do we measure keyword search performance?
Properties of term-weighted value
Take-Home Messages
Three Ways of Looking at Speech
Deep Neural Network
A Stacked DNN Architecture
Convolutional Neural Network
Considered 2 CNN Architectures
Recurrent Neural Network
Bidirectional LSTM Architecture
Three Use Cases
More Expressive Architectures Make a Big Difference
Fixed Features Allow for Rapid Development
Our partners
Babel resources
Taught by
MITCBMM