Demystifying Machine Learning in Production - Reasoning about a Large-Scale ML Platform

Demystifying Machine Learning in Production - Reasoning about a Large-Scale ML Platform

USENIX via YouTube Direct link

ML outages from the outside

8 of 15

8 of 15

ML outages from the outside

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Demystifying Machine Learning in Production - Reasoning about a Large-Scale ML Platform

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 4 things you can do for more reliable ML
  3. 3 ML on one machine
  4. 4 ML in production
  5. 5 What makes ML in prod interesting
  6. 6 What goes wrong?
  7. 7 4 things for more reliable ML
  8. 8 ML outages from the outside
  9. 9 Where changes happen: binaries
  10. 10 Where changes happen: configuration
  11. 11 Validating binary and config changes
  12. 12 Where changes happen: data
  13. 13 Validating data updates
  14. 14 Improving data integrity
  15. 15 Handling pipeline backlogs

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.