Develop the skills required to architect and manage batch processing applications to generate consistent data-driven results.
Overview
Syllabus
Introduction
- Architecting big data applications
- Characteristics of batch processing
- Challenges building batch applications
- Technologies for batch big data engineering
- Use cases for batch big data
- Architecture process for data engineering
- Making the choice: Real-time vs. batch
- Horizontal scaling
- Distributed processing
- Technology selection
- Technology integrations
- Schedule selection
- Minimizing data volumes
- Uniform load distribution
- Using caches
- Reprocessing
- Audit trail: Define the problem
- Audit trail: Study requirements
- Audit trail: Create a workflow
- Audit trail: Scale the workflow
- Audit trail: Select technologies
- Audit trail: Review final architecture
- Advertising analytics: Define the problem
- Advertising analytics: Study requirements
- Advertising analytics: Create a workflow
- Advertising analytics: Scale the workflow
- Advertising analytics: Select technologies
- Advertising analytics: Review final architecture
- Product recommendations: Define the problem
- Product recommendations: Study requirements
- Product recommendations: Create a workflow
- Product recommendations: Scale the workflow
- Product recommendations: Select technologies
- Product recommendations: Review the final architecture
- Continuing to architect big data applications
Taught by
Kumaran Ponnambalam