Explore a comprehensive research seminar that delves into the intricacies of imitation learning (IL) and behavior cloning (BC) in sequential decision-making tasks. Learn how behavior cloning with logarithmic loss can achieve horizon-independent sample complexity in offline IL under specific conditions, challenging previous assumptions about the gap between offline and online imitation learning. Discover the implications for applications in robotics, autonomous driving, and autoregressive language generation through the analysis presented by Microsoft Research principal researcher Dylan Foster. Gain insights into learning-theoretic perspectives on policy classes, including deep neural networks, and understand how controlled cumulative payoffs and supervised learning complexity influence the effectiveness of imitation learning approaches.
Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Harvard CMSA via YouTube
Overview
Syllabus
Dylan Foster | Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Taught by
Harvard CMSA