Overview
Syllabus
- Intro
- Sponsor: Assembly AI
- Start of Interview
- What's the pitch?
- How did data bootstrapping come into the project?
- How big of a problem is data quality?
- Are the captioning & filtering models biased towards COCO data?
- Could the data bootstrapping be done multiple times?
- What was the evolution of the BLIP architecture?
- Are there additional benefits to adding language modelling?
- Can we imagine a modular future for pre-training?
- Diving into the experimental results
- What did and did not work out during the research?
- How is research life at Salesforce?
- Where do we go from here?
Taught by
Yannic Kilcher