Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Data Pipelines - Introduction to Text Analytics with R Part 3

Data Science Dojo via YouTube

Overview

Explore data pipelines in this third installment of the introduction to text analytics with R video. Dive into textual data exploration for pre-processing challenges, utilize the quanteda package for text analytics, and create a prototypical text analytics pre-processing pipeline. Learn about tokenization, lower casing, stop word removal, and stemming. Develop skills to create a document-frequency matrix used for training machine learning models. Access the Kaggle dataset and R code used in the series to practice hands-on. Gain valuable insights into text analytics techniques and their application in data science projects.

Syllabus

Intro
HTML Escapes
Quantium
Tokenization
Tokens
Stop Words
Quantity
Stem
DFM

Taught by

Data Science Dojo

Reviews

Start your review of Data Pipelines - Introduction to Text Analytics with R Part 3

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.