Overview
Syllabus
Intro
Mapping Instructions to Actions
Mapping Instructions to Control • Drone state is determined by a configuration of target velocities
Modular Approach
Single-model Approach
Visitation Distribution for Navigation
Simulation-Reality Joint Learning
Training Data
Learning Architecture Supervised Learning Learning
RL for Control
Reward Goals
SuReAL Supervised and Reinforcement Asynchronous Learning
Experimental Setup
Human Evaluation
Game Environment
Two Players
Game Objective
Action and Dynamic
The CerealBar Scenario
The Task Goal: learn to map leader instructions to follower actions
Instruction Following in Collaborations
Learning with Error Propagation
Generating Error Recovery Examples
Generated Recovery Example
Data Collection
Data Statistics
Data Under-utilization
Cascaded Evaluation Metrics
Results: Held-out Games
Taught by
Paul G. Allen School