The Detect Entities node allows you to identify text entities in an input string field. An entity is a textual reference to the unique name of an object, such as people, places, dates and quantities. The node uses the Amazon Comprehend API to identify entities in incoming text fields.
The node outputs all input fields, and one new array field called 'DetectedEntities'. You can click a row in the DetectedEntities column to show the following information:
- BeginOffset - The position of the first character of the word in the string.
- EndOffset - The position of the last character of the word in the string.
- Score - The estimated accuracy of the analysis.
- Text - The detected text entity.
- Type - The entity type. The following table lists the entity types which are recognized by the node:
Type | Description |
---|---|
COMMERCIAL_ITEM | A branded product. |
DATE | A date (e.g. 12/22/2018), a day (e.g. Thursday), a month (e.g. May) or a time (e.g. 9:15 a.m.). |
EVENT | An event for example a concert or an election. |
LOCATION | A location such as a country, a city, a river or a building. |
ORGANIZATION | An organization such as a company, a sports team or a religion. |
OTHER | Entities that do not fit in any of the other categories. |
PERSON | Individuals, groups of people, nicknames, fictional characters. |
QUANTITY | A quantified amount, such as currency, percentages, numbers or bytes. |
TITLE | An official name given to any creation or creative work, such as movies, books, or songs. |
For more information, please see the Amazon Comprehend documentation, for example https://docs.aws.amazon.com/comprehend/latest/dg/how-entities.html
Example
Input text: The customer is John Smith, a 50 year old who lives in New York.
In this example, the following entities can be identified:
BeginOffset | EndOffset | Score | Text | Type |
---|---|---|---|---|
17 | 26 | 0.9914761 | John Smith | PERSON |
31 | 32 | 0.9981457 | 50 | QUANTITY |
56 | 63 | 0.8912875 | New York | LOCATION |
Properties
Display Name
Specify the name of the node that is displayed on the Analysis Designer canvas.
The default value is Detect Entities.
Detect Entities
Select an input string field to analyze.
Language
Select the language of the text to analyze.
The default value is English.