Overview
Syllabus
Intro
Big data and ML infra are similar
Speaker background
Why invest in ML infra?
Case study: Building a new TF runtime
ML program as a computational graph
An example ML program
Lifetime of an ML program
Vectorized normalization
A slight digression on Eager execution
ML infra and SQL query processing
(Random) scan-based access patterns
Beyond pure dataflow
ML and DB terminology mapping
Recall graph processing workflow
Expressing input pipelines
Decoupled API and execution
Challenge: Randomized transformations
Graph rewrites
Cost model and data stats
Constraint propagation
Storage/access optimizations
Push vs pull based execution
Distributed and parallel execution
ML infra is like data infra, with new twists
Let's collaborate
Taught by
TensorFlow