What is "ChatGPT"? How does it work? How is this related to all the other buzzwords? "transformers", "large language models", "autoregressive decoding"... Let's break all this down.
This class is a "how it works" course that shows you how ChatGPT works from the inside out. In particular, we cover the general technology -- more broadly called "Large Language Models". We’ll cover many topics and takeaways:
- What Large Language Models are, and how they relate to ChatGPT
- How neural networks process and generate text
- Concepts for processing language like word2vec
- How a transformer processes and generates text of any length
- Critical concepts in transformers, such as autoregressive decoding
This class does not require any prior knowledge but does assume you've taken my Artificial Intelligence for Beginners course. Regardless of your background, you’ll walk away with the fundamentals for discussing and learning more about Large Language Models.
Interested in more machine learning? Try my Computer Vision 101 (Applied ML) classes.
Interested in learning how to code? Check out my Coding 101 (Python), OOP 101 (Python), or VR101 (HTML) class.
Interested in data science? Check out my SQL 101 (Database Design) or Data 101 (Analytics) class.