Evaluating LLM Applications - Insights from Shahul Es

Evaluating LLM Applications - Insights from Shahul Es

MLOps.community via YouTube Direct link

[] Benchmarks

8 of 20

8 of 20

[] Benchmarks

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Evaluating LLM Applications - Insights from Shahul Es

Automatically move to the next video in the Classroom when playback concludes

  1. 1 [] Shahul's preferred coffee
  2. 2 [] Takeaways
  3. 3 [] Please like, share, and subscribe to our MLOps channels!
  4. 4 [] Shahul's definition of Evaluation
  5. 5 [] Evaluation metrics and Benchmarks
  6. 6 [] Gamed leaderboards
  7. 7 [] Best at summarizing long text open-source models
  8. 8 [] Benchmarks
  9. 9 [] Recommending evaluation process
  10. 10 [] LLMs for other LLMs
  11. 11 [] Debugging failed evaluation models
  12. 12 [] Prompt injection
  13. 13 [] Alignment
  14. 14 [] Open Assist
  15. 15 [] Garbage in, garbage out
  16. 16 [] Ragas
  17. 17 [] Valuable use case besides Open AI
  18. 18 [] Fine-tuning LLMs
  19. 19 [] Connect with Shahul if you need help with Ragas @Shahules786 on Twitter
  20. 20 [] Wrap up

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.