Overview
Syllabus
Intro: ChatGPT, Language Models and the Goals of Generalist Robotics Policies
Reading and exploring the data
Creating a Dataset
Creating a Dataset
Creating the transformer encoder
Creating image patches to tokenized
Putting together the VIT
Training the VIT
Making the GRP, starting with adding text inputs
Modifying the data for training
Converting continuous actions to discrete bins
Converting continuous actions to discrete bins
Standardizing the state inputs
Changing to use continuous actions
Standizing the action space
Adding goal images to the transformer
Adding blocked masked attention to use either goal
Scaling training
Training results across A100s
Evaluation using the SimpleEnv robotics simulator
Taught by
Montreal Robotics