Online k-means Clustering on Arbitrary Data Streams - Lecture
USC Probability and Statistics Seminar via YouTube
Overview
Syllabus
Intro
k-means clustering
The online setting
The Goal(s) Cohen-Addad et. al. 2021
A troubling example
Lower Bound
A natural starting point: streaming
A difficult case for streaming
Idea 1: don't remove centers
Proof Sketch
We still have problems on pathological examples.
Idea 2: Using the scale to delete points.
The Lemma Revisited
Our Algorithm's Performance
Proof idea
Future Directions
Taught by
USC Probability and Statistics Seminar