Indexify Unveiled: A Scalable Content Extraction Engine for Multimodal Data
Qdrant - Vector Database & Search Engine via YouTube
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about scalable content extraction for multimodal unstructured data in this technical talk from Tensorlake founder Diptanu Gon Choudhury. Explore the development of Indexify, an open-source engine designed for near real-time knowledge base construction and AI workflows. Discover key insights about compute engines for unstructured data processing, the significance of hybrid search approaches, and the comparative analysis of RAG versus fine-tuning methodologies. Gain practical understanding of how online systems maintain updated indexes for real-world LLM applications, drawing from Diptanu's extensive experience developing cluster schedulers at Hashicorp and Netflix, as well as machine learning platforms at Facebook. Delve into the technical requirements and architectural considerations for building effective retrieval systems and memory components for AI agents.
Syllabus
Indexify Unveiled - Diptanu Gon Choudhury | Vector Space Talk #009
Taught by
Qdrant - Vector Database & Search Engine