Overview
Watch a 15-minute technical video exploring reinforcement learning applications in multi-agent systems, focusing on two key examples: optimizing reward functions for a fleet of New York taxis and implementing swarm intelligence for 100 drones navigating Jupiter's atmosphere. Learn about transformer-based reward function optimization, environmental interaction learning, and the challenges of multi-agent reinforcement learning in complex scenarios. Delve into game theory principles applied to robotic swarm behavior while examining fundamental limitations and open problems in reinforcement learning from human feedback, referencing current research from recent arxiv publications.
Syllabus
Robotics Policy Optimization on 100 drones (game theory)
Taught by
Discover AI