Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

CM3- A Causal Masked Multimodal Model of the Internet

Yannic Kilcher via YouTube

Overview

Explore a comprehensive video featuring a paper explanation and in-depth interview with first author Armen Aghajanyan on CM3: A Causal Masked Multimodal Model of the Internet. Delve into the innovative approach of using HTML structure for machine learning, including text, hyperlinks, and images. Learn about the new Causally Masked Language Modelling training strategy and its implications for autoregressive language models. Discover how CM3 addresses limitations in current models and enables various applications such as text generation, captioning, image creation, and entity linking. Gain insights into the training process of large-scale models, dataset collection, content filtering, and experimental results. Understand the potential impact of CM3 on the future of universal models in artificial intelligence.

Syllabus

- Intro & Overview
- Directly learning the structure of HTML
- Causally Masked Language Modelling
- A short look at how to use this model
- Start of interview
- Feeding language models with HTML
- How to get bi-directionality into decoder-only Transformers?
- Images are just tokens
- How does one train such giant models?
- CM3 results are amazing
- Large-scale dataset collection and content filtering
- More experimental results
- Why don't we use raw HTML?
- Does this paper contain too many things?

Taught by

Yannic Kilcher

Reviews

Start your review of CM3- A Causal Masked Multimodal Model of the Internet

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.