Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

OpenAI Transcription API

via Pluralsight

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
OpenAI's Whisper model offers speech-to-text and translation that can be used to convert audio business
communications to digitized text. This course will teach you how to use Whisper to solve your speech archiving and
content analysis problems.

Audio archives take up a lot of archive space and can be difficult to catalog and understand. In this course, OpenAI Transcription API, you’ll learn to use OpenAI's remarkably accurate Whisper service to convert your speech content to more easily manageable text formats. First, you’ll explore Whisper's available models and endpoints. Next, you’ll discover the code you'll need to invoke a model through the API. Finally, you’ll learn how to fine-tune your transcriptions and translations to provide the perfect balance of functionality and operational cost. When you’re finished with this course, you’ll have the skills and knowledge of Whisper needed to manage your audio resources.

Syllabus

  • Course Overview 1min
  • Understanding the OpenAI Whisper Service 8mins
  • Using the Whisper API to Transcribe and Translate Speech 13mins

Taught by

David Clinton

Reviews

Start your review of OpenAI Transcription API

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.