Overview
Syllabus
[] Musical introduction to Rahul Parundekar
[] LLMs in Production Conference announcement
[] Purchase our Swag shirt!
[] Declarative Paradigm
[] Why now?
[] It's great for scalability
[] Most MLOps tools work well with K8s
[] Easy-deploys with tool-provided CRDs
[] Caveats
[] This talk
[] 3 Ways to Serve ML Models
[] Way 1: Serving a Model with an HTTP Endpoint
[] Way 2: Serving the Model with a Message Queue
[] Way 3: Long-running Task that Performs Batch Processing
[] Buil your own container
[] The main predictor 1/2: Singleton with load method
[] The main predictor 2/2: Predict
[] Way 1 5 steps
[] Way 2 2 steps
[] Way 3 2 steps
[] Tests: Sanity check for the model
[] Bringing it together: Entrypoint
[] Continuous Integration CI
[] Create docker-compose.yaml to make it easier for CI
[] On PR: Run tests with Github Actions
[] Branch-protection
[] On PR: Github Actions automatically runs our test
[] On PR: PRs can be then merged on approval
[] Container Repository
[] Continuous Integration CI
[] On merge to main
[] Actions that can constraint
[] TODO
[] Continuous Delivery
[] Argo CD
[] Image promotion with Kustomize
[]
[]
[]
[]
[]
[]
[]
[]
[]
[]
Taught by
MLOps.community