Perfect Voice Applications with Chirp and Speech Fine-Tuning

Overview

Explore the latest advancements in speech recognition technology with this 27-minute conference talk from Google Cloud Next. Dive into the capabilities of Vertex AI and Chirp, a 2 billion parameter foundation model, revolutionizing speech-to-text applications. Learn how to leverage large models for speech tasks and fine-tune them for specific use cases using in-domain data. Discover the journey from initial setup to perfecting voice models for your applications. Gain insights into the Speech API, Chirp, and Bell Canada's implementation of call listening. Understand the high-level architecture, top 3 use cases, and pitch effectiveness. Explore speech tuning techniques, their implementation, and the impressive results achieved. Perfect for developers and enterprises looking to enhance their voice applications and stay at the forefront of speech recognition technology.

Syllabus

Intro
Speech API
Chirp
Bell Canada
How call listening works
High level architecture
Top 3 use cases
Pitch Effectiveness
Next Steps
Speech Tuning
How we do it
How it works
Results

Taught by

Google Cloud Tech

Reviews

Start your review of Perfect Voice Applications with Chirp and Speech Fine-Tuning

Taught by

Power New Voice-Enabled Interfaces and Applications with Google Cloud's Speech Solutions

Natural Language Processing

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.