Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Codecademy

Intro to Language Models in Python

via Codecademy

Overview

Build the basic language models in Python.
From spam filters to ChatGPT, computer and AI systems have to work with language models all the time. In this course, you will learn how to build and work with the most common language models in data science, including bag-of-words, tf-idf, and word embeddings. Learn the basic skills you need before working with more advanced AI language models.


* Create common language models

* Measure similarity between words

* Determine important words in documents


### Notes on Prerequisites
This course requires a basic knowledge of Python. If you are a beginner, start with [Learn Python for Data Science](https://www.codecademy.com/learn/paths/learn-python-for-data-science) or learn to build language models from scratch in our [Build Chatbots with Python](https://www.codecademy.com/learn/paths/build-chatbots-with-python) Skill Path.

Syllabus

  • Welcome to Language Models with Python: A brief overview of what you will learn in this course.
    • Informational: Welcome to Language Models with Python
  • Bag-of-Words Language Model: Whet your language model appetite with the widely used Bag-of-Words. Develop the underlying functionality in Python, then use scikit-learn.
    • Lesson: Bag-of-Words Language Model
    • ExternalResource: Working with Text Data | scikit-learn
    • Project: Mystery Friend
    • Quiz: Bag-of-words language model
  • Term Frequency-Inverse Document Frequency: Rethink topic models with term frequency-inverse document frequency (tf-idf), which adjusts the importance of words within a document.
    • Lesson: Term Frequency–Inverse Document Frequency
    • ExternalResource: Working with Text Data | scikit-learn | From Occurrences to Frequencies
    • Project: Read the News Analysis
    • Quiz: Term Frequency-Inverse Document Frequency
  • Word Embeddings: Quantify meaning based on context using word embeddings.
    • Lesson: Word Embeddings
    • ExternalResource: Token.similarity | spaCy
    • Project: U.S.A. Presidential Vocabulary
    • Quiz: Word Embeddings
    • Informational: What's Next

Taught by

Alex DiStasi

Reviews

4.3 rating at Codecademy based on 43 ratings

Start your review of Intro to Language Models in Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.