Visual Features for Context-Aware Speech Recognition - 2016
Center for Language & Speech Processing(CLSP), JHU via YouTube
Overview
Syllabus
Intro
Outline
Automatic Speech Recognition
Speech Variability (Spectral)
Decoding Procedure
Experimental Setup
Simple Extensions
Performance on Switchboard
IARPA "Aladdin" Project
Speaker Microphone Distance (SMD)
Training SMD Extractors
Training SMD descriptors
SMD Results
SMD Analysis
Audio-Visual ASR
Speaker Attributes
Speaker Actions
Semantic Indexing CNN Features
Fusion of Approaches
Analysis "indoor" vs "outdoor"
Summary
Taught by
Center for Language & Speech Processing(CLSP), JHU