Overview
Syllabus
Intro
What is Veara
Talk Outline
Search Engine Challenges
Search and AI
History of search
Time
Retrieval
Neural IR
Tokenization
Handling typos
Handling vocabulary tokens
Retrieval models
Training models
Positive signals
Negative mining
Types of systems
Behavioral matching
Search architectures
Alibaba
Conversational search
User expectation of search
Models make stuff up
GP Open AI
Galactica Model
Hallucination
Why do models hallucinate
How are people using these models
Problems with closed book systems
Training our own model
LLM leaderboard
New York Times article
Next steps
Taught by
OpenSource Connections