Learn how to turn data into information, using one of the world's most popular programming languages.
Overview
Syllabus
Introduction
- Data science: Making sense out of chaos
- What is data science anyway?
- Data science examples
- Data as a business asset
- CRISP-DM: The data science cycle
- Types of problems in data science
- Data formatting in Java
- More data formatting
- Real-life data difficulties
- Mapping
- Filtering
- Collecting
- Sorting
- Challenge: Combining data operations
- Solution: Combining data operations
- Reducing file size
- Loading data from text files
- Creating a person data class
- Converting strings to data objects
- Loading tab-separated files
- Loading CSVs
- Converting CSVs to data objects
- Challenge: Manipulating data
- Solution: Manipulating data
- Setting up JavaFX
- Formatting data for a scatterplot
- Displaying a scatterplot
- Multiple datasets on a scatterplot
- Calculating average MPG
- Displaying a bar chart
- Challenge: Displaying data on a bar chart
- Solution: Displaying data on a bar chart
- Building machine learning models
- Supervised vs. unsupervised learning
- Overfitting and how to avoid it
- K-nearest neighbor basics
- Loading flower data
- Creating a DataItem interface
- Calculating the closest data points
- Implementing the DataItem interface
- Letting your data points vote
- Finishing your KNN classifier
- Naive Bayes basics
- Calculating the possible labels
- Splitting your dataset by label
- Calculating mean and standard deviation
- Calculating datapoint probabilities
Taught by
Shaun Wassell