Brightspot Integrations Guide

Amazon Textract


With Amazon Textract, you can extract text from assets based on the content types Document, Spreadsheet, Presentation, and Attachment. Brightspot associates the extracted text with the asset, and editors can then search for and use your asset in their own content.

Text extraction from PDF in Brightspot Text extraction from PDF in Brightspot
Note
The Amazon Textract integration is currently not available for image files you add to Brightspot.

This section describes how to configure the Amazon Textract integration in Brightspot, and how to view extracted text.

Including Amazon Textract in a Brightspot build

The following table lists the dependencies to include in your build configuration.

ArtifactDescription
com.psddev:aws-textractExposes Textract-related controls in Sites & Settings, as well as the UI and processing to submit and display results of Textract jobs.

Runtime prerequisites

See also:

Previous Topic
Applying suggested tags to images
Next Topic
Configuring the Amazon Textract integration
Was this topic helpful?
Thanks for your feedback.
Our robust, flexible Design System provides hundreds of pre-built components you can use to build the presentation layer of your dreams.

Asset types
Module types
Page types
Brightspot is packaged with content types that get you up and running in a matter of days, including assets, modules and landing pages.

Content types
Modules
Landing pages
Everything you need to know when creating, managing, and administering content within Brightspot CMS.

Dashboards
Publishing
Workflows
Admin configurations
A guide for installing, supporting, extending, modifying and administering code on the Brightspot platform.

Field types
Content modeling
Rich-text elements
Images
A guide to configuring Brightspot's library of integrations, including pre-built options and developer-configured extensions.

Google Analytics
Shopify
Apple News