Completed
[] Best at summarizing long text open-source models
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Evaluating LLM Applications - Insights from Shahul Es
Automatically move to the next video in the Classroom when playback concludes
- 1 [] Shahul's preferred coffee
- 2 [] Takeaways
- 3 [] Please like, share, and subscribe to our MLOps channels!
- 4 [] Shahul's definition of Evaluation
- 5 [] Evaluation metrics and Benchmarks
- 6 [] Gamed leaderboards
- 7 [] Best at summarizing long text open-source models
- 8 [] Benchmarks
- 9 [] Recommending evaluation process
- 10 [] LLMs for other LLMs
- 11 [] Debugging failed evaluation models
- 12 [] Prompt injection
- 13 [] Alignment
- 14 [] Open Assist
- 15 [] Garbage in, garbage out
- 16 [] Ragas
- 17 [] Valuable use case besides Open AI
- 18 [] Fine-tuning LLMs
- 19 [] Connect with Shahul if you need help with Ragas @Shahules786 on Twitter
- 20 [] Wrap up