Overview
Embark on an advanced step-by-step tutorial to deploy a Python RAG/AI project to the AWS cloud, transforming it into a public API hosted on AWS Lambda for scalability and high performance. Dive into RAG concepts, explore project architecture, and integrate FastAPI. Master Docker image building, implement deployment hacks, and conduct local testing. Learn to construct AWS infrastructure using CDK and create an asynchronous API. Access the provided GitHub repository for code references and explore related videos covering RAG basics, FastAPI, AWS fundamentals, and Docker on Lambda to enhance your understanding of cloud deployment strategies for AI applications.
Syllabus
- Introduction
- RAG Recap
- Project Architecture
- Adding FastAPI
- Building a Docker Image
- Deployment Hacks
- Local Testing With Docker
- Build AWS Infrastructure with CDK
- Creating an Async API
- Wrapping Up
Taught by
pixegami