Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the fundamentals of reinforcement learning in this comprehensive lecture from the Data-Driven Decision Processes Boot Camp. Delve into three key algorithms: TD learning, Q-Learning, and Natural Policy Gradient. Gain insights into the core concepts behind obtaining finite-time performance bounds for each algorithm. Building upon previous knowledge of Markov Decision Processes (MDPs), this talk by Rayadurgam Srikant from the University of Illinois Urbana-Champaign provides an in-depth look at the foundations of reinforcement learning, offering valuable knowledge for researchers and practitioners in the field of artificial intelligence and machine learning.