Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building Multimodal AI RAG with LlamaIndex, NVIDIA NIM, and Milvus - LLM App Development

Nvidia via YouTube

Overview

Explore the process of building a multimodal AI retrieval-augmented generation (RAG) application in this 17-minute video tutorial. Learn how to convert documents into text using vision language models like NeVA 22B and DePlot, utilize GPU-accelerated Milvus for efficient embedding storage and retrieval, leverage NVIDIA NIM API's Llama 3 model for handling user queries, and seamlessly integrate all components with LlamaIndex. Gain practical insights into document processing, vector database management, inference techniques, and orchestration for creating a smooth Q&A experience. Access the accompanying notebook for hands-on practice and join the NVIDIA Developer Program for additional resources. Discover how to combine cutting-edge technologies such as LangChain, Mixtral, and NIM APIs to develop advanced LLM applications.

Syllabus

Building Multimodal AI RAG with LlamaIndex, NVIDIA NIM, and Milvus | LLM App Development

Taught by

NVIDIA Developer

Reviews

Start your review of Building Multimodal AI RAG with LlamaIndex, NVIDIA NIM, and Milvus - LLM App Development

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.