Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

LinkedIn Learning

Build an Image Captioning Tool for Visually Impaired Users with Gemini

via LinkedIn Learning

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Find out how artificial intelligence can help you make better web experiences for visually impaired users.

Syllabus

Introduction
  • Image captioning with AI
  • What you should know
  • Who this course is for
1. Setting Up Access to Gemini API
  • Understanding Gemini models
  • Gemini pricing
  • Signing up for an Google AI Studio account
  • Getting your API key
2. Building the Interface
  • Cloning the seed project
  • Project code walkthrough
  • Adding the image upload functionality
  • Adding the prompt functionality
  • Writing the caption display
3. Building the Backend: Connecting to Gemini
  • Building out the Express.js API
  • Configuring the Generative AI SDK
  • Adding routes
  • Setting up file upload functionality
  • Writing the prompt request and response
4. Bringing It All Together
  • Connecting the frontend to the API
  • Adding a progress indicator
  • Using the Web Speech API to read captions
Conclusion
  • Next steps

Taught by

Fikayo Adepoju

Reviews

5 rating at LinkedIn Learning based on 5 ratings

Start your review of Build an Image Captioning Tool for Visually Impaired Users with Gemini

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.