Training Neural Networks for Temporal Difference Model Predictive Control - Part 2
HuggingFace via YouTube
Overview
Syllabus
- Listing the neural networks we need to train
- What a training batch item looks like
- Forward passes and losses
- Why the latent state representation does not collapse
- Understanding TD Learning
- TD learning intuition in real experiments
- Optimizing the Q network using the TD error
- Offline vs online data collection and training loop
- Wrapping up
Taught by
Hugging Face