Completed
- Mirror Descent Policy and MDLOO
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
The Machine Learning Behind Apple Intelligence - Understanding Modern LLM Architecture
Automatically move to the next video in the Classroom when playback concludes
- 1 - Intro
- 2 - Chapter 1 - Overview
- 3 - Pretraining
- 4 - Structured Pruning
- 5 - Knowledge Distillation
- 6 - Post Training
- 7 - Iterative Teaching Committee
- 8 - Chapter 2 - Adapters
- 9 - LoRA Low Rank Adapters
- 10 - Quantization Palettization
- 11 - Chapter 3 - RLHF
- 12 - Reward Modelling
- 13 - Leave One Out
- 14 - Mirror Descent Policy and MDLOO
- 15 - Results