FastEmbed - Fast and Lightweight Embedding Generation
Qdrant - Vector Database & Search Engine via YouTube
Overview
Learn about efficient embedding generation in this 32-minute technical talk from Vector Space Talks featuring Nirant Kasliwal, AI Engineer at Qdrant and creator of FastEmbed. Discover how this Python library maximizes speed and efficiency through quantized models and ONNX Runtime to achieve superior throughput and latency. Gain insights from Kasliwal's extensive experience as the maintainer of FastEmbed and contributor to the OpenAI Cookbook's Finetuning section. Explore practical implementations of embedding generation techniques that prioritize both performance and usability in modern AI applications.
Syllabus
FastEmbed: Fast & Lightweight Embedding Generation - Nirant Kasliwal | Vector Space Talks #004
Taught by
Qdrant - Vector Database & Search Engine