Overview
Syllabus
intro
starter code walkthrough
let’s fix the learning rate plot
pytorchifying our code: layers, containers, torch.nn, fun bugs
overview: WaveNet
dataset bump the context size to 8
re-running baseline code on block_size 8
implementing WaveNet
training the WaveNet: first pass
fixing batchnorm1d bug
re-training WaveNet with bug fix
scaling up our WaveNet
experimental harness
WaveNet but with “dilated causal convolutions”
torch.nn
the development process of building deep neural nets
going forward
improve on my loss! how far can we improve a WaveNet on this data?
Taught by
Andrej Karpathy