Detect Entities - Data360_DQ+ - Latest

Data360 DQ+ Help

Product type
Software
Portfolio
Verify
Product family
Data360
Product
Data360 DQ+
Version
Latest
Language
English
Product name
Data360 DQ+
Title
Data360 DQ+ Help
Copyright
2024
First publish date
2016
ft:lastEdition
2024-07-09
ft:lastPublication
2024-07-09T15:09:58.774265
Note: This node is only available with the Cloud edition of the product.

The Detect Entities node allows you to identify text entities in an input string field. An entity is a textual reference to the unique name of an object, such as people, places, dates and quantities. The node uses the Amazon Comprehend API to identify entities in incoming text fields.

Detect Entities node

The node outputs all input fields, and one new array field called 'DetectedEntities'. You can click a row in the DetectedEntities column to show the following information:

  • BeginOffset - The position of the first character of the word in the string.
  • EndOffset - The position of the last character of the word in the string.
  • Score - The estimated accuracy of the analysis.
  • Text - The detected text entity.
  • Type - The entity type. The following table lists the entity types which are recognized by the node:
Type Description
COMMERCIAL_ITEM A branded product.
DATE A date (e.g. 12/22/2018), a day (e.g. Thursday), a month (e.g. May) or a time (e.g. 9:15 a.m.).
EVENT An event for example a concert or an election.
LOCATION A location such as a country, a city, a river or a building.
ORGANIZATION An organization such as a company, a sports team or a religion.
OTHER Entities that do not fit in any of the other categories.
PERSON Individuals, groups of people, nicknames, fictional characters.
QUANTITY A quantified amount, such as currency, percentages, numbers or bytes.
TITLE An official name given to any creation or creative work, such as movies, books, or songs.

For more information, please see the Amazon Comprehend documentation, for example https://docs.aws.amazon.com/comprehend/latest/dg/how-entities.html

Example

Input text: The customer is John Smith, a 50 year old who lives in New York.

In this example, the following entities can be identified:

BeginOffset EndOffset Score Text Type
17 26 0.9914761 John Smith PERSON
31 32 0.9981457 50 QUANTITY
56 63 0.8912875 New York LOCATION

Properties

Display Name

Specify the name of the node that is displayed on the Analysis Designer canvas.

The default value is Detect Entities.

Detect Entities

Select an input string field to analyze.

Language

Select the language of the text to analyze.

The default value is English.