Overview
In this fourth course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will build upon the data engineering concepts introduced in the first three courses to apply Python, Bash and SQL techniques in tackling real-world problems. First, we will dive deeper into leveraging Jupyter notebooks to create and deploy models for machine learning tasks. Then, we will explore how to use Python microservices to break up your data warehouse into small, portable solutions that can scale. Finally, you will build a powerful command-line tool to automate testing and quality control for publishing and sharing your tool with a data registry.
Syllabus
- Jupyter Notebooks
- In this module, you will learn how to install and run Jupyter on your local machine. Additionally, you will explore strategies to use code and text cells in a Jupyter notebook.
- Cloud-Hosted Notebooks
- In this module, you will learn how to create and use a Cloud-based notebook in Google Colab and AWS Sagemaker.
- Python Microservices
- In this module, you will learn how to build a Python Microservice with FastAPI and deploy a containerized machine learning Microservice for data engineering.
- Python Packaging and Rust Command Line Tools
- In this module, you will learn how to organize a Python project so you can build a powerful command-line tool. You will use Click, a useful command-line tool framework to enhance your tool. Finally, you will automate testing and quality control for publishing and sharing your tool with a registry.
Taught by
Noah Gift, Kennedy Behrman and Alfredo Deza