Multimodal RAG Application Development with LlamaIndex, Google AI, Gemini Pro and Qdrant
The Machine Learning Engineer via YouTube
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to build a multimodal RAG (Retrieval-Augmented Generation) application that extracts information from images in this 25-minute video tutorial. Explore the implementation process using LlamaIndex framework, Google AI Studio SDK, Gemini Pro models, and Qdrant vector database. Follow along with practical demonstrations and access the complete source code through the provided GitHub repository to develop your own image-based information extraction system.
Syllabus
RAG: Multi-modal RAG with Llamaindex, Google Ai, Gemini Pro & Qdrant #machinelearning #datascience
Taught by
The Machine Learning Engineer