A Friendly Introduction to Deep Reinforcement Learning, Q-Networks and Policy Gradients

A Friendly Introduction to Deep Reinforcement Learning, Q-Networks and Policy Gradients

Serrano.Academy via YouTube Direct link

Markov decision processes MDP:

2 of 12

2 of 12

Markov decision processes MDP:

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

A Friendly Introduction to Deep Reinforcement Learning, Q-Networks and Policy Gradients

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Introduction:
  2. 2 Markov decision processes MDP:
  3. 3 Rewards:
  4. 4 Discount factor:
  5. 5 Bellman equation:
  6. 6 Solving the Bellman equation:
  7. 7 Deterministic vs stochastic processes:
  8. 8 Neural networks:
  9. 9 Value neural networks:
  10. 10 Policy neural networks:
  11. 11 Training the policy neural network:
  12. 12 Conclusion:

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.