Overview
Learn how to add voice and speech capabilities to the Jetson Nano using NVIDIA's inference library for image recognition and text-to-speech functionality. Explore techniques for recognizing images and audibly speaking the identified items in real-time video. Gain hands-on experience setting up the inference engine, building and running threads, creating sound, and implementing speech output. Follow along with step-by-step instructions to create a program that combines image recognition with text-to-speech, enhancing the AI capabilities of your Jetson Nano project.
Syllabus
Introduction
Start a new program
Write the code
Inference Engine Setup
First Run
Building the Thread
Running the Thread
Creating the Sound
Speaking
Testing
Taught by
Paul McWhorter