Beyond Lazy Training for Over-parameterized Tensor Decomposition

Beyond Lazy Training for Over-parameterized Tensor Decomposition

Institute for Pure & Applied Mathematics (IPAM) via YouTube Direct link

Two-Layer Neural Network

5 of 15

5 of 15

Two-Layer Neural Network

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Beyond Lazy Training for Over-parameterized Tensor Decomposition

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Tensor (CP) decomposition
  3. 3 Why naïve algorithm fails
  4. 4 Why gradient descent?
  5. 5 Two-Layer Neural Network
  6. 6 Form of the objective
  7. 7 Difficulties of analyzing gradient descent
  8. 8 Lazy training fails
  9. 9 O is a high order saddle point
  10. 10 Our (high level) algorithm
  11. 11 Proof ideas
  12. 12 Iterates remain close to correct subspace
  13. 13 Escaping local minima by random correlation
  14. 14 Amplify initial correlation by tensor power method
  15. 15 Conclusions and Open Problems

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.