Jump-start your data science career by learning how to install and work with several essential data science tools, including Proxmox, Hadoop, Spark, and Weka.
Overview
Syllabus
Introduction
- What data science tools must you know?
- Course organization
- Introduction
- Data science
- Fundamental skills
- Tools of trade
- Enabling technologies
- Cloud computing and virtualization
- Cloud fundamentals
- Types of cloud
- Solution providers
- Private cloud hands-on with Proxmox
- Proxmox: Bootable installation disk
- Proxmox: Installation
- Proxmox: Managing virtual machines
- Proxmox: Creating and configuring virtual machines
- Distributed file systems
- Fundamentals
- Distributed systems and distributed processing
- Hadoop hands-on
- Hadoop: Preparation
- Hadoop: Installation
- Hadoop: MapReduce hands-on
- Distributed processing with MapReduce
- Distributed processing with Spark
- Spark architecture and features
- Spark: Installation
- Spark: Spark shell
- Spark: pyspark
- Spark: Application
- Machine learning
- Fundamentals
- Types of machine learning
- Weka: Installation
- Weka: GUI
- Weka: Training vs. testing
- Weka: Clustering
- Putting it all together
- Hadoop cluster: Installation
- Hadoop cluster: Operation
- Spark, YARN, and Hadoop
- Weka and Spark
- Next steps
Taught by
Jungwoo Ryoo