Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scale R to Big Data with Hadoop & Spark

Data Science Dojo via YouTube

Overview

Learn how to scale R for big data processing using Hadoop and Spark in this 1-hour 10-minute tutorial. Set up a Spark cluster with R installed, wrangle data stored in HDFS using R, and build and deploy machine learning models on large datasets. Discover how to utilize Microsoft R Server to enable distributed computing in R, run native R code via SSH, and set up RStudio server on a cluster. Explore techniques for data manipulation in HDFS, model building on large-scale data, and deploying models to elastically scaled web services for predictions and insights. Gain practical skills to overcome R's traditional limitations with big data and leverage its capabilities throughout the entire data science workflow.

Syllabus

Scale R to Big Data with Hadoop & Spark

Taught by

Data Science Dojo

Reviews

Start your review of Scale R to Big Data with Hadoop & Spark

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.