Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CodeSignal

Text Data Preprocessing in Python

via CodeSignal

Overview

Learn to clean and prepare textual data for machine learning models using Python. This course teaches you to apply basic preprocessing tasks such as text lowercasing, removing stopwords, tokenization, and stemming on the SMS Spam Collection dataset. By the end of this course, you’ll have the skills to transform raw text into a format that's ready for NLP tasks.

Syllabus

  • Lesson 1: Lowercasing Text for Uniformity in NLP
  • Lesson 2: Punctuating Punctuation: Streamlining Text for NLP
  • Lesson 3: Tokenizing Text Data in NLP with Python and NLTK
  • Lesson 4: Demystifying Stop Words in Natural Language Processing
  • Lesson 5: Mastering Stemming in NLP with NLTK

Reviews

Start your review of Text Data Preprocessing in Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.