Overview
Syllabus
Introduction
About Qualcomm AI Research
Challenges with AI workloads
Model efficiency pipeline
Challenges
DONNA
Fourstep process
Example
Blocks
Models
Accuracy predictor
Yields
Linear regression
Evolutionary search
Evolutionary sampling
Finetuning
Results
Model pruning
Unstructured pruning
Structured compression
Main takeaway
Quantization research
Quantization
Recent papers
Adaptive rounding
AI model efficiency tool
Key results
Highlevel view
Mixed precision
Mixed precision on a chip
APQ
Running networks conditionally
Classification example
Multiscale dense nets
Semantic segmentation
Dynamic convolutions
Video processing
Skip convolutions
Video classification
Summary
Questions
Sponsors
Taught by
tinyML