Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

OpenAI Embeddings with Voice Cloning - Eleven Labs API, ChatGPT API, Whisper API

Part Time Larry via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn how to build a sophisticated question-answering voice assistant with realistic voice responses in this comprehensive tutorial. Explore the integration of OpenAI Embeddings, ChatGPT API, Whisper API, and Eleven Labs API to create a powerful AI-driven assistant. Discover techniques for voice cloning, natural language processing, and user interface development using Gradio. Follow along as the instructor demonstrates how to construct a Q&A corpus, implement vector embeddings, and utilize cosine similarity for accurate answer retrieval. Gain insights into incorporating AI-generated avatars, handling microphone input, and optimizing voice synthesis settings. By the end of this tutorial, you'll have the knowledge to create your own advanced voice assistant with customizable voices and intelligent responses.

Syllabus

Project Description: Q&A + Voice Cloning
The movie “Her” and the Idea of Smarter Assistants
Voice Sampling
Demo Voice #1 Samantha Voice
Demo Voice #2 Jay-Z Voice, Rhyming Responses
Hip Hop Music and Sampling Analogy
Hip Hop Production, Rick Rubin, Taste and Technical Ability Clip
Recap of OpenAI For Finance Series So Far, Prerequisites
Building a Q&A Corpus, Vector Embeddings, Cosine Similarity Review
Building a User Interface with Gradio, Starter Code from Video #9
Voice Cloning with Eleven Labs API
Python Code Walkthrough - config.py constants, voice ID, custom prompts
Eleven Labs API - Example Request and Response Payloads
Avatars and AI Art Generation with Midjourney, Nvidia Stock Win
Python Code Walkthrough - advisor.py, requirements.txt
Gradio User Interface Development, Microphone Input, Avatar Display
UI Launch, Debugging Mode, Sharing Your App, Mobile Devices
Transcribe Function, OpenAI Whisper API
Incorporating Word Embeddings, Question Vector, Cosine Similarity, Answers
ChatGPT API, Conversation History, Stuffing the Prompt with Context
Eleven Labs API Request with Python, Text to Speech, Voice Synthesis Settings
Outputting Binary Response / MP3 to Audio Output
Final Words of Advice from Jay-Z

Taught by

Part Time Larry

Reviews

Start your review of OpenAI Embeddings with Voice Cloning - Eleven Labs API, ChatGPT API, Whisper API

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.