Learn how to use Hive to analyze large datasets and derive information from Hadoop. Learn how to work with tables, structures, aggregations, clauses, functions, and more.
Overview
Syllabus
Introduction
- Welcome
- What you should know before watching this course
- Using the exercise files
- Why use Hive
- How Hive works
- Setting up our demo environment
- Understanding table structures in Hive
- Creating tables in Hive
- Handling CSV files in Hive
- Partitioning tables
- Simple SELECT statement
- Retrieving data from complex structures
- Simple aggregations
- Enhanced aggregations with grouping sets
- Using CUBE and ROLLUP
- Simple filter with the WHERE clause
- Filtering aggregates with HAVING clause
- Finding similar values with LIKE
- Combining tables with JOIN
- When to use SEMI JOIN
- Joining multiple tables together
- Types of data manipulation functions
- String functions
- Math functions
- Date functions
- Conditional functions
- Next steps
Taught by
Ben Sullins