Amazon Elastic Compute Cloud (Amazon EC2) provides scalable computing capacity for your workloads in the cloud. Monitoring the performance of your infrastructure through the lens of observability verifies the continued optimal performance of your Amazon EC2 instances. The ability to identify and quickly troubleshoot issues mitigates negative impacts to your business and ensures continued business operations. This course introduces observability and monitoring techniques and their importance to the health of your EC2 instances. Additionally, you will interact with tools and services to assist you in supporting and troubleshooting common issues.
- Course level: Intermediate
- Duration: 120 minutes
Activities
This course includes interactive lessons and knowledge check questions.
Course objectives
In this course, you will learn to do the following:
- Define observability and its role in the health of your EC2 instances.
- Identify the tools and services that support observability and troubleshooting of your EC2 instances.
- Describe the methodology for troubleshooting Amazon EC2 issues.
- Apply the troubleshooting methodology to common Amazon EC2 issues.
Intended audience
This course is intended for the following roles:
- Cloud architects
- Developers
- DevOps professionals
- AdministratorsÂ
Prerequisites
We recommend that attendees of this course have completed or possess knowledge of the following:
- Experience working with the Linux operating system
- AWS Cloud Practitioner Certification
- AWS Compute Services Overview course
- Amazon EC2 Basics course
- Build with Amazon Elastic Compute Cloud (Amazon EC2) course
- Rightsize Your Amazon EC2 Workload course
Course outline
Section 1: Course Introduction and Scenario
- Introduction
- Course Scenario
Section 2: Observability and Monitoring
- Defining Observability
- Observability Strategies
- Gaining Visibility Using Data Type
Section 3: Tools and Resources
- Introduction to Observability Tools
- Amazon EC2 Console: Observability Features
- Amazon CloudWatch: Terminology and Usage
- AWS Trusted Advisor: Inspecting Your Environment
- Amazon EventBridge: Instance Monitoring
- AWS CloudTrail: Monitoring Access
Section 4: Troubleshooting and Incident Response
- Incident Response Process
- Introducing the Troubleshooting Scenarios
Section 5: Performance and Latency Troubleshooting Scenario
- Performance and Latency Scenario: Identify and Collect
- Performance and Latency Scenario: Analyze, Develop, Test, and Implement
- Performance and Latency Scenario: Perform Post-Incident Review and Document the Solution
Section 6: Connectivity Troubleshooting Scenario
- Connectivity Scenario: Identify and Collect
- Connectivity Scenario: Analyze, Develop, Test, and Implement
- Connectivity Scenario: Perform Post-Incident Review and Document the Solution
Section 7: Course Scenario Summary
- Course Summary
- Knowledge Check
Section 8: Additional Resources
- Course Feedback
- Resources