Overview
Syllabus
Intro
Data science with Python is hot
We're different
Let's do something together: sort EuroPython site EuroPyton abstracts
Why we love numpy 100 000 term frequency vs inverse doc frequency
arrays are nothing but pointers A numpy array
Array computing is fast
Array computing is limited by CPU starvation
Numerics versus control flow What if there is an if
numerics vs databases
Operations on chunks Machine learning, data mining = numerics
Operations on chunks, or algorithms on chunks Machine learning, data mining = numerics
Making the data-science magic happens
Data/computation flow is crucial
Ingredients for future data flows
The Python VM is great
Scikit-learn is easy machine learning As easy as py
Difference is richness, but requires outreach
Taught by
EuroPython Conference