This lab shows you how to create a Google Cloud Dataproc cluster, run a simple Apache Spark job in the cluster, then modify the number of workers in the cluster, all from the gcloud Command Line. Watch these short videos, <A HREF="https://youtu.be/h1LvACJWjKc">Dataproc: Qwik Start - Qwiklabs Preview</A> and <A HREF="https://youtu.be/UOX9G6ArJRc">Run Spark and Hadoop Faster with Cloud Dataproc</A>.
Overview
Syllabus
- GSP103
- Overview
- Setup and requirements
- Task 1. Create a cluster
- Task 2. Submit a job
- Task 3. View the job output
- Task 4. Update a cluster
- Task 5. Test your understanding
- Congratulations!