DeepMind's AlphaGo Zero and AlphaZero - RL Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube
Overview
Syllabus
- AlphaGo lineage of agents
- Comparing AlphaGo Zero with AlphaGo
- High-level explanation of AlphaGo Zero inner workings
- MCTS recap
- Training details and curves
- Architecture impact
- Knowledge acquired
- Results
- Discovering joseki
- Human domain knowledge in AlphaGo Zero
- Pipeline overview
- Self-play thread explained
- Further details PUCT recap, etc.
- AlphaZero what's new?
Taught by
Aleksa Gordić - The AI Epiphany