Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore an in-depth analysis of the DETR (DEtection TRansformer) paper, which introduces a revolutionary approach to object detection using transformers. Learn how this innovative method streamlines the detection pipeline by eliminating the need for hand-designed components like non-maximum suppression and anchor generation. Discover the key elements of the DETR framework, including its set-based global loss and transformer encoder-decoder architecture. Understand how the model uses learned object queries to reason about object relations and global image context, producing final predictions in parallel. Examine the architecture in detail, delving into the bipartite match loss function, object queries, and transformer properties. Compare DETR's performance to established baselines on the COCO dataset and explore its potential for panoptic segmentation. Gain insights into this conceptually simple yet powerful approach that achieves comparable accuracy and run-time performance to highly-optimized object detection models.
Syllabus
- Intro & High-Level Overview
- Problem Formulation
- Architecture Overview
- Bipartite Match Loss Function
- Architecture in Detail
- Object Queries
- Transformer Properties
- Results
Taught by
Yannic Kilcher