Overview
Syllabus
Intro
Sequential Decision Making in Complex Environments
Data from the Existing Simulation Environments
Procedural Generation of New Environments
Benchmarking RL Generalization
Benchmarking Safe Reinforcement Learning
Benchmarking Multi-Agent Reinforcement Learning
Real2Sim: Learning to generate traffic scenarios
Pretraining Policy Representation with Real World Data
Self-supervised Learning through Contrastive Learning
Policy Pretraining with Human Actions
Action-conditioned Contrastive Learning
Pretrained Representation for Imitation Learning
Human-in-the-loop Reinforcement Learning
Human-Al Copilot Optimization (HACO)
Demo Video: Learning to drive in CARLA environment
Policy Dissection through Frequency Analysis
Taught by
Bolei Zhou