Overview
Syllabus
Intro
General-Purpose Learning Algorithms
Deep Blue vs. Garry Kasparov
Reinforcement Learning Framework
Grounded Cognition
Deep Reinforcement Learning
DON: A General Atari Player
Adding Memory to Neural Networks
Memory: Neural Turing Machines
3D Environments - Navigation
The History of Go
A repeated board position is not allowed
Evaluation function for Go
Intuition vs Calculation
Training the deep neural networks
Two networks: Policy and Value Nets
Combining Neural Nets with Tree Search and Rollouts
Cutting Down the Search Tree
Evaluating AlphaGo against computers
Google DeepMind Challenge Match: 9-15 March
Note on Compute Power
We won the match 4-1
Game 2 - AlphaGo's Move 37
Cultural Impact of the match
Rate of Progress: -1 rank per month
Intuition and Creativity
Taught by
MITCBMM