Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Multimodal Embeddings - Introduction and Use Cases with Python

Shaw Talebi via YouTube

Overview

Learn about multimodal embeddings in this 25-minute technical video that explores how different data types can be represented in the same vector space. Dive into the fundamentals of embeddings before exploring how contrastive learning enables the creation of multimodal embedding spaces. Follow along with Python-based demonstrations of two practical applications: zero-shot image classification and image search systems. Access complementary resources including a detailed blog post and GitHub repository with implementation code. Explore key concepts through a structured progression from basic embedding principles to advanced multimodal applications, supported by references to foundational papers like BERT, ViT, and CLIP. Gain insights into the future directions of multimodal AI while building practical understanding through hands-on examples.

Syllabus

Introduction -
What are embeddings? -
Multimodal Embeddings -
Contrastive Learning -
Contrastive Learning Details -
Example 1: 0-shot Image Classification -
Example 2: Image Search -
What's Next? -

Taught by

Shaw Talebi

Reviews

Start your review of Multimodal Embeddings - Introduction and Use Cases with Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.