A Friendly Introduction to Deep Reinforcement Learning, Q-Networks and Policy Gradients

Overview

Coursera Plus Flash Sale: All Certificates & Courses 40% Off. 72 Hours Only!

Grab it

Explore deep reinforcement learning, Q-networks, and policy gradients in this friendly 36-minute video tutorial. Dive into key concepts such as Markov decision processes, rewards, discount factors, and the Bellman equation. Learn about deterministic and stochastic processes before delving into neural networks, including value and policy networks. Understand how to train policy neural networks and gain insights through examples and figures. Perfect for those with a basic understanding of neural networks, this comprehensive guide covers everything from introduction to conclusion, offering a solid foundation in reinforcement learning techniques.

Syllabus

Introduction:
Markov decision processes MDP:
Rewards:
Discount factor:
Bellman equation:
Solving the Bellman equation:
Deterministic vs stochastic processes:
Neural networks:
Value neural networks:
Policy neural networks:
Training the policy neural network:
Conclusion: