Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
USC Information Sciences Institute via YouTube
Overview
Syllabus
Intro
Single-Task Model vs. Unified Model
Single-Task Model for Vision
Image Output Quantization
Text Input for Different Tasks
Model Details
Objective
Dataset and Implementations
Pre-training Distribution
Evaluation
GRIT requires diverse skills
Results
Semantic Segmentation
Depth Estimation
Object Detection
Image Inpainting
Segmentation based image generation
Summary
Tasks Distribution
Taught by
USC Information Sciences Institute