This lab introduces you to workflows designed to showcase the use of the Amazon Textract Intelligent Document Processing (IDP) using Amazon Cloud Development Kit (CDK) constructs. IDP allows for more accurate and detailed text recognition from images by using pre-processing techniques such as filtering, noise reduction, and other image manipulation techniques to improve the quality of the images before they are processed. Amazon Textract is a cloud-based service that automatically extracts text and data from scanned documents. The Amazon Textract IDP CDK Constructs are a set of pre-defined code patterns for using Amazon Textract to identify and redact sensitive information in documents.
Level
Fundamental
Duration
1 Hours 0 Minutes
Course Objectives
In this course, you will learn how to:
By the end of this lab, you are be able to do the following:
- Assess the CDK Textract constructs
- Analyze an image using the Textract SimpleSyncWorkflow
- Analyze a document using the Textract SimpleAsyncWorkflow
Intended Audience
This course is intended for:
- Developers & Data Engineers
Prerequisites
We recommend that attendees of this course have the following prerequisites:
To successfully complete this lab, it is recommended to be familiar with the following:
- Using a Linux terminal
- Basic navigation of the AWS Management Console
- Uploading files to an Amazon Simple Storage Service (Amazon S3) bucket using the AWS ClI.
Course Outline
- Task 1: Connect to the AWS Cloud9 environment and review the CDK project
- Task 2: Analyze an image in Textract using the SimpleSyncWorkflow
- Task 3: Analyze a document in Textract using the SimpleAsyncWorkflow