Explore a groundbreaking hobby project that narrates the world in real-time for blind and visually impaired individuals using Generative AI and computer vision technologies. Learn how the "Be My Eyes" project leverages advanced object detection and computer vision models from OpenAI to extend the scope of experience for those with visual impairments. Discover the simple yet powerful set-up involving a video camera that continuously records the user's surroundings, feeding real-time recordings into an AI model trained to analyze and interpret visual content. Understand how the system converts the AI model's textual narration of scenes into audio descriptions via a text-to-speech model, creating a remarkable technological integration that brings the visual world to life for those who cannot see it.
Overview
Syllabus
Be My Eyes - Leveraging Generative AI to help the blind and visually impaired people by Danushka
Taught by
Devoxx