Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Deploy RAG/AI App to AWS Cloud - Step-by-Step Tutorial

pixegami via YouTube

Overview

Embark on an advanced step-by-step tutorial to deploy a Python RAG/AI project to the AWS cloud, transforming it into a public API hosted on AWS Lambda for scalability and high performance. Dive into RAG concepts, explore project architecture, and integrate FastAPI. Master Docker image building, implement deployment hacks, and conduct local testing. Learn to construct AWS infrastructure using CDK and create an asynchronous API. Access the provided GitHub repository for code references and explore related videos covering RAG basics, FastAPI, AWS fundamentals, and Docker on Lambda to enhance your understanding of cloud deployment strategies for AI applications.

Syllabus

- Introduction
- RAG Recap
- Project Architecture
- Adding FastAPI
- Building a Docker Image
- Deployment Hacks
- Local Testing With Docker
- Build AWS Infrastructure with CDK
- Creating an Async API
- Wrapping Up

Taught by

pixegami

Reviews

Start your review of Deploy RAG/AI App to AWS Cloud - Step-by-Step Tutorial

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.