Learn to clean and prepare textual data for machine learning models using Python. This course teaches you to apply basic preprocessing tasks such as text lowercasing, removing stopwords, tokenization, and stemming on the SMS Spam Collection dataset. By the end of this course, you’ll have the skills to transform raw text into a format that's ready for NLP tasks.
Overview
Syllabus
- Lesson 1: Lowercasing Text for Uniformity in NLP
- Lesson 2: Punctuating Punctuation: Streamlining Text for NLP
- Lesson 3: Tokenizing Text Data in NLP with Python and NLTK
- Lesson 4: Demystifying Stop Words in Natural Language Processing
- Lesson 5: Mastering Stemming in NLP with NLTK