Overview
Explore cutting-edge advancements in 3D object manipulation through this insightful conference talk on View Transformers. Delve into the development and capabilities of RVT and RVT-2, state-of-the-art multi-view transformer models designed for complex 3D manipulation tasks in robotics. Learn how these models predict gripper pose actions using camera images and task descriptions, offering significant improvements over existing methods. Discover the impressive performance gains, including faster training times, improved task success rates, and enhanced real-world applicability with minimal demonstrations. Gain insights into how RVT-2 addresses challenges in high-precision tasks, setting new benchmarks on the RLBench dataset and demonstrating strong real-world performance. Presented by Ankit Goyal, a Research Scientist in Robotics at NVIDIA, this talk provides valuable knowledge for those interested in the latest developments in robotic manipulation and computer vision.
Syllabus
Ankit Goyal: View Transformers for 3D Manipulation in Robotics
Taught by
Montreal Robotics