Extracting Speaker and Emotion Information from Self-Supervised Speech Models

Overview

Explore the extraction of speaker and emotion information from self-supervised speech models in this 38-minute conference talk by Themos Stafylakis from the Center for Language & Speech Processing (CLSP) at Johns Hopkins University. Delivered as part of JSALT 2023, the 30th edition of the workshop held in Le Mans, France, this presentation delves into cutting-edge techniques for analyzing speech data. Learn about the latest advancements in self-supervised learning applied to speech processing, with a focus on extracting valuable information about speakers and their emotional states. Gain insights into the research conducted at CLSP and its potential applications in various fields of speech technology and natural language processing.

Syllabus

Extracting speaker and emotion information from self-supervised speech models -- Themos Stafylakis

Taught by

Center for Language & Speech Processing(CLSP), JHU

Reviews

Start your review of Extracting Speaker and Emotion Information from Self-Supervised Speech Models

Taught by

Finite State Methods with Modern Neural Architectures for Speech Applications and Beyond

Explainability for Diarization - JSALT 2023 Team Presentation

Neural Conversational AI - JSALT 2023 Team Presentation

Weighted Finite State Automata and Linear Algebra - JSALT 2023 Team Presentation

Representation and Metric Learning Advances for Face and Speaker Biometric Systems

Text and Context in Natural Language Processing - JSALT 2023 Team Presentation

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.