Configuring the Amazon Textract integration
This topic explains how to configure the Amazon Textract integration in Brightspot.
To configure Amazon Textract:
- Obtain the following from your AWS console:
- Name of the SQS queue managing messages between Amazon Textract and Brightspot. For a list of your available queues, see your SQS console.
- ARN of the topic to which Amazon Textract publishes messages. For a list of your available topics, see your SNS console.
- ARN of the role with permissions to make calls to Amazon Textract. For a list of your available AWS roles, see your IAM console.
- Click > Admin > Sites & Settings > Sites > Global.
- Configure the interface with Amazon Textract by doing the following:
- Expand Integrations > AWS Textract.
- Toggle on Enable Textract Service.
- Enter the SQS Queue Name, Topic ARN, and Role ARN you determined in step 1.
- In the Minimum Block Confidence field, enter confidence values for text within each block. Generally, higher confidence levels provide more accurate results (fewer false positives) but may miss some matches (more false negatives).
- Configure the thumbnail generator by doing the following:
- Expand CMS > DAM Document Data Extraction Settings.
- Under Extractor Services, click , and select Textract Document Data Extractor.
- From the Thumbnail Extractor list, select Pdf Document Data Extractor.
- Click Save.
Textract is configured, and editors can view the results of a text extraction in the content edit form.
Previous Topic
Amazon Textract
Next Topic
Viewing an asset's extracted text