BloombergGPT - Building a 50 Billion Parameter Financial Language Model
Toronto Machine Learning Series (TMLS) via YouTube
Overview
Explore the development process of BloombergGPT, a groundbreaking 50 billion parameter language model specifically designed for finance, in this insightful conference talk. Discover how the model was trained on a unique combination of general-purpose datasets and diverse financial documents from Bloomberg archives. Learn about the challenges faced during the training process, including loss spikes, unexpected parameter drifts, and performance plateaus, and how the team overcame these obstacles. Gain valuable insights into the specific hurdles encountered when building large language models (LLMs) and receive guidance on whether to embark on your own LLM journey. Understand how BloombergGPT outperforms existing models on financial tasks while maintaining competitive performance on general LLM benchmarks. Explore real-world examples that demonstrate how BloombergGPT distinguishes itself from general-purpose models in the financial domain.
Syllabus
BloombergGPT: How We Built a 50 Billion Parameter Financial Language Model
Taught by
Toronto Machine Learning Series (TMLS)