Overview
Syllabus
[] Simon preferred beverage
[] Takeaways
[] Simon's tech background
[] Zombie models garbage collection
[] The road to LLMs
[] Trained models Simon worked on
[] LLM Checkpoints
[] Confidence in AI Training
[] Different Checkpoints
[] Checkpoint parts
[] Slurm vs Kubernetes
[] Storage choices lessons
[] Paramount components for setup
[] Argo workflows
[] Kubernetes node troubleshooting
[] Cloud virtual machines have pre-installed mentoring
[] Fine-tuning
[] Storage, networking, and complexity in network design
[] Start simple before advanced; consider model needs.
[] Join us at our first in-person conference on June 25 all about AI Quality
Taught by
MLOps.community