What you'll learn:
- Set up a cluster, keyspaces, column families and manage them
- Run queries using the CQL command shell
- Design primary keys and secondary indexes with partitioning and clustering considerations
- Use the Cassandra Java driver to connect and run queries on the cluster
Taught byateam which includes 2Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scaledata processing.
Has your data gotten huge, unwieldy and hard to manage with atraditional database? Is your data unstructured with anexpanding list of attributes? Do you want to ensure your data is always available even withserver crashes? Look beyond Hadoop - the Cassandra distributed database is the solution to your problems.
Let's parse that.
- Huge, unwieldy data:This course helps your set up a cluster with multiple nodes to distribute data across machines
- Unstructured:Cassandra is a columnar store. There are no empty cells or space wasted when you store data with variable and expanding attributes
- Always available:Cassandra uses partitioning and replication to ensure that your data is available even when nodes in a cluster go down
What's included in this course:
- The Cassandra Cluster Manager (CCM) to set up and manage your cluster
- The Cassandra Query Language (CQL) to create keyspaces, column families, perform CRUD operations on column families and other administrative tasks
- Designing primarykeys and secondary indexes, partitioning and clustering keys
- Restrictions on queries based on primary and secondary key design
- Tunable consistency using quorum and local quorum.Read and write consistency in a node
- Architecture andStorage components: Commit Log, MemTable, SSTables, Bloom Filters, Index File, Summary File and Data File
- A real world project: A Miniature Catalog Management System using the Cassandra Java driver