Detect Entities in Medical Text - Data360_DQ+ - Latest

Data360 DQ+ Help

Product type
Software
Portfolio
Verify
Product family
Data360
Product
Data360 DQ+
Version
Latest
Language
English
Product name
Data360 DQ+
Title
Data360 DQ+ Help
Copyright
2024
First publish date
2016
ft:lastEdition
2024-07-09
ft:lastPublication
2024-07-09T15:09:58.774265
Note: This node is only available with the Cloud edition of the product.

The Detect Entities in Medical Text node allows you to inspect an input string field to identify words that are medical entities. Medical entities can include Protected Health Information (PHI) and medical conditions, for example. The node uses the Amazon Comprehend API to identify medical entities in incoming text fields.

Detect Entities in Medical Text node

The node outputs all input fields, and one new array field called 'DetectedEntities'. You can click a row in the DetectedEntities column to show the following information:

  • BeginOffset - The position of the first character of the entity in the string.
  • EndOffset - The position of the last character of the entity in the string.
  • Score - The estimated accuracy of the analysis.
  • Text - The detected text entity.
  • Type - The entity type, for example 'AGE' or 'NAME'.
  • Category - The entity category, for example 'PROTECTED_HEALTH_INFORMATION'.
  • Id - A record ID.
  • Response - The complete API response in JSON format. This response can be processed further for traits and attribute information.

Example

Input text: The patient is a 40 year old teacher.

BeginOffset EndOffset Score Text Type Category Id Response
18 19 0.9981877 40 AGE PROTECTED_HEALTH_INFORMATION 0 { "Id":0, "beginOffset":18, "endOffset": 19, "score": 0.9981877, "text": "40", "traits":[],"attributes":[], "categoryAsString": "PROTECTED_HEALTH_INFORMATION", "typeAsString": "AGE"}
30 36 0.53624517 teacher PROFESSION PROTECTED_HEALTH_INFORMATION 1 { "Id":0, "beginOffset":30, "endOffset": 36, "score": 0.53624517, "text": "teacher", "traits":[],"attributes":[], "categoryAsString": "PROTECTED_HEALTH_INFORMATION", "typeAsString": "PROFESSION"}

 

For more information, please see the Amazon Comprehend documentation, for example https://docs.aws.amazon.com/comprehend/latest/dg/how-medical-phi.html

Properties

Display Name

Specify the name of the node that is displayed on the Analysis Designer canvas.

The default value is Detect Entities in Medical Text.

Detect Medical Entities in

Select an input field to analyze.

Detect PHI Only

Select this option to return only entities that are categorized as PHI (Protected Health Information).

Enterprise properties

If you are using the Enterprise edition, you will also need to enter AWS credentials in the following properties:

  • Access Key
  • Secret