Overview
Syllabus
Introduction
Search interface 20
Analytics interface of today
Optimised for real time
Worked examples
UK Housing data: percentiles
UK Housing data: terms
Geo as a common link between datasets: housing crime
Connected data: Enron emails
Recommendations: MovieLens data
Random samples should hold no surprises
Non random sample: people who liked Talladega nights
Problem: avoid analysis of poorly focused sets
How do we get a smaller, representative sample of users?
Putting search and analytics together..
Amazon marketplace reviews
Anatomy of an entity indexing groovy script
Drilling down into seller #187's fanboys
UK car roadworthiness test: raw data
Derived car attributes
Miles driven vs number of days for fix
In summary
Questions?
Taught by
GOTO Conferences