OpenAI - Solving Rubik's Cube with a Robot Hand - RL Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube
Overview
Syllabus
Intro
Comparison with Dactyl system
High-level overview
Tasks Rubik's cube and block reorientation
Physical system overview
Reading angles from the cube electronics
Realistic modeling of the system in simulation
Automatic Domain Randomization ADR
Cube size randomization during training blog
Entropy and rand param probability distribution
ADR pseudocode
Rapid
Randomizations
PPO
Actions and rewards
Policy network, embed and add
Behavioural cloning
Vision pipeline
Focal loss
Results
Perturbation robustness
Meta-learning
Predicting environment variables from LSTM hidden state
Taught by
Aleksa Gordić - The AI Epiphany