Completed
Spatial Distribution of Speech Clusters
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Unsupervised Learning of Spoken Language with Visual Context
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Challenge for Automatic Speech Recognition
- 3 A Perspective on Spoken Language Processing Most (-9%) of the worlds languages have not been addressed by resource and expert intensive supervised
- 4 Crossing the Vision Language Boundary
- 5 Learning an Audio/Visual Embedding Space?
- 6 Joint Audio-Visual Analysis Architecture
- 7 Crowdsourcing Audio-Visual Data
- 8 Evaluation: Image and Search Annotation
- 9 Evaluating via Image Search
- 10 Evaluating via Image Annotation
- 11 Time-varying Audio-Visual Affiliation
- 12 Audio-Visual Grounding for Localization
- 13 Examples of Audio-Visual Clusters
- 14 Cluster Analysis
- 15 Spatial Distribution of Speech Clusters
- 16 Final Message