Overview
Explore the comprehensive data stack in this 35-minute lecture on data management. Dive into various sources of data, including filesystems, object stores, databases, and data warehouses, while understanding latency numbers. Learn techniques for exploring and processing data, and discover the concept of feature stores. Gain insights into best practices and sample datasets for practical application. Delve into self-supervised learning and data labeling methods to enhance your data management skills. Conclude with an overview of data versioning techniques. Access detailed notes and slides for further study, and subscribe to follow along with the full 2022 course.
Syllabus
Key points
Sources of data: filesystems, latency numbers, object stores, databases, data warehouses
Exploring data
Processing data
Feature stores
Summary of best practices and some sample datasets
Self-supervised learning and data labeling
Data versioning
Taught by
The Full Stack