Sequence Modeling with Neural Networks
Alexander Amini and Massachusetts Institute of Technology via YouTube
Overview
Syllabus
Intro
What is a sequence?
a sequence modeling problem
idea: use a fixed window
problem: we can't model long-term dependencies
idea: use entire sequence, as a set of counts
idea: use a really big fixed window
problem: no parameter sharing
to model sequences, we need
example network
RNNS remember their previous state
"unfolding" the RNN across time
remember: backpropagation
let's try it out for W with the chain rule
backpropagation through time
problem: vanishing gradient
activation functions
initialization
gated cells
possible task: music generation
possible task: machine translation
problem: a single encoding is limiting
solution: attend over all encoder states
Taught by
https://www.youtube.com/@AAmini/videos