Overview
Explore the challenges and lessons learned from training large AI language models in this 30-minute conference talk from MLOps World: Machine Learning in Production. Gain insights from Bandish Shah, Engineering Manager at MosaicML/Databricks, as he shares experiences in handling massive datasets, designing effective model architectures, optimizing training procedures, and managing computational resources across hundreds of GPUs. Discover the intricacies of training models with billions of parameters and delve into the "sausage making" behind large language models. Suitable for ML researchers, practitioners, and those curious about the complexities of training advanced AI systems.
Syllabus
Training LLMs: Lessons from the Trenches
Taught by
MLOps World: Machine Learning in Production