Running a Question-Answering System on Ray Serve at Deepset

Overview

Explore the process of running a question-answering system on Ray Serve in this 31-minute talk from Anyscale. Delve into the key architectural components of question-answering systems, including data stores, indexing pipelines, and querying pipelines. Learn about Haystack, an open-source framework that connects multiple transformer state-of-the-art NLP models into a single pipeline. Discover how to deploy GPU-empowered inference using Ray, assemble NLP models into pipelines, run Hugging Face models on Ray Serve, deploy NLP model pipelines, and access persistent storage from code deployed on Ray Serve. Gain valuable insights into enhancing your question-answering systems and leveraging Ray Serve for improved performance and scalability.

Syllabus

Running a question-answering system on Ray Serve at Deepset

Taught by

Anyscale

Reviews

Start your review of Running a Question-Answering System on Ray Serve at Deepset

Taught by

Long Form Question Answering in Haystack

A Cheap Trick for Semantic Question Answering for GPU-Challenged Systems

Generative Question-Answering with OpenAI's GPT-3.5 and Davinci

Building World-Class NLP Models with Transformers and Hugging Face

The Geometry of Intelligence in Large Question-Answering Systems

Never Stop Learning.