Overview
Syllabus
Intro
Machine learning and neural networks
3-spin model on the high-dimensional sphere
Neural networks and approximation theory
Functional formulation in the limit of large n
Parameters as particles with loss function as interacting potential
Error scaling - Central Limit Theorem (CLT)
Discrete training set and stochastic gradient descent
Limiting stochastic differential equation for SGD
Dean's equation for particles with correlated noise
Learning with Gaussian kemels
Learning with single layer networks with sigmoid nonlinearity
Taught by
Alan Turing Institute