Entity-based Recode Tables - trillium_discovery - trillium_quality - Latest

Trillium Control Center

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Version
Latest
Language
English
Product name
Trillium Quality and Discovery
Title
Trillium Control Center
Copyright
2024
First publish date
2008
Last updated
2024-10-18
Published on
2024-10-18T15:02:04.502478

Entity-based recode tables allow the user to load an existing entity (real or dynamic) in the repository and use its attributes for recoding. This method can also be used when you have a quality project that has already been run to generate the recode data, and you want to use the same recode values or masks in a subsequent project.

Note: The maximum attribute length for table recodes is 2000 bytes (1000 characters in UCS2).

You are building two data quality projects using similar input data so that you can work independently on the two processes. After creating and executing the table recode for the first project, you want to utilize the first project's output as the source for the second project's table recode. The output of Project 1 is as follows:

Project 1, Output Entity = us_tranfrmr_p1, Attribute = Product_Type

Product_Type Product _Type_Recoded
MA8 4EE MA08 4EE
MA9 2DD MA09 2DD
MS32 1JJ MA32 1JJ
MA4 2NN MA04 2NN

In Project 2, you see the same values such as "MA8 4EE" in one of the input attributes. You can use the output entity of Project 1, us_tranfrmr_p1 as the source of the recodes and match the attributes in Project 2 against the attributes in Project 1.

To generate an entity-based recode table

  1. From the Navigation or Project View, right-click the Transformer process and select Edit Process. You can also double-click the process to open it for editing.
  2. Select the Output Conditionals tab.
  3. Right-click within the Transformation Rules pane (right-hand side) and select Add > New > Table Recode.
  4. Enter a description for this table recode operation. The description will be inserted in the Statistics File.
  5. Select Entity as the source of recodes.
  6. Select the entity containing the recodes that you want to use.

    If you select a dynamic delimited entity, you must define the maximum lengths for the attributes in the entity. Click Adjust attribute length and specify maximum width (in characters) for each attribute and click OK. The Adjust attribute length feature also has a Set All option.

  7. Click Show Data to display the entity in a list view.
  8. Review the data and the effect they have on the attribute's values. When you are satisfied with the recode, click Next.
  9. Select the attribute from the Match attribute list. This is the Transformer output attribute which will be matched against the original value in the recode entity specified in Step 6.
    Note: You can match up to five (5) attributes. Specifying multiple attributes creates an AND condition for the table recode operation. See Recoding Data Using Multiple Attributes.
  10. Select the attribute from the to recode attribute list. This is the attribute in the recode entity where the original values are described.
  11. Select the recode type (literal or mask). Click Next.
  12. Select the attribute from the Copy recode attribute list. This is the attribute in the recode entity that contains the new values.
  13. Select the destination attribute from the to attribute list. This is the Transformer output attribute which will store the recoded value.
  14. Select the recode type (literal or mask).
  15. (Optional) Select Case sensitive matching to use case-sensitive matching. By default, matching is case-insensitive. The case sensitive option does not apply to masks, only to values.
  16. Click Finish. The scheduler notification bar opens.
  17. (Optional) Change the job name.
  18. Do one of the following:
    • To schedule the job to run immediately, click Now. The message closes and the job begins running in the background.
    • Click Later. The Set Date & Time scheduler window opens. To immediately run the job in the background, click Run Now. To schedule a time, select a starting date on the calendar and select a time you want the job to run on the selected date. Click Submit to save the scheduled time.
    • Click Cancel to cancel the task.
    Note: All scheduled jobs are run in the background. Note the progress in the Background Tasks List View.

    The recode table and its ddx file will be created in path\repository_name\projectN\tables, where path is the path to the repository directory (specified during installation), repository_name is the name of your repository and N is the number of your project. For example, C:/ProgramData/Trillium Software/.../repository1/project1/tables.

    When the Transformer is run, it will apply the recodes to the output attributes.