AlphaGo - Mastering the Game of Go with Deep Neural Networks and Tree Search - RL Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube
Overview
Syllabus
Intro
Context behind the game of Go
High-level overview of components - SL policies
RL policy network
The value network
Going deeper
Details around value network
Understanding the search MTCS
Evaluation of AlphaGo
Older techniques
Even more detailed explanation of APV-MTCS
Virtual loss
Engineering
Neural networks and symmetries
Taught by
Aleksa Gordić - The AI Epiphany