Efficient Query Processing for Unstructured Data Using Machine Learning

Efficient Query Processing for Unstructured Data Using Machine Learning

Databricks via YouTube Direct link

Unstructured data is ubiquitous and cheap

2 of 9

2 of 9

Unstructured data is ubiquitous and cheap

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Efficient Query Processing for Unstructured Data Using Machine Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Unstructured data is ubiquitous and cheap
  3. 3 ML models can perform well on a range of benchmark tasks
  4. 4 My work: how can we use unreliable and expensive ML models in query processing?
  5. 5 Two key ideas: sampling and proxy scores
  6. 6 Many queries require statistical guarantees on accuracy
  7. 7 Prior work using proxies fail to achieve statistical guarantees on failure probability!
  8. 8 Example query: finding hummingbirds with high recall
  9. 9 Query type two: aggregation Query: "what is the average number of cars per frame?"

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.