Attribute-based Recode Tables - trillium_discovery - trillium_quality - Latest

Trillium Control Center

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Version
Latest
Language
English
Product name
Trillium Quality and Discovery
Title
Trillium Control Center
Copyright
2024
First publish date
2008
Last updated
2024-10-18
Published on
2024-10-18T15:02:04.502478

If you have specified a set of value or mask recodes during data profiling, and you want to use the same set of recodes in a data quality project, use attribute-based recode tables. This feature allows you to use a recode table from your profiling results.

Note: The maximum attribute length for table recodes is 2000 bytes (1000 characters in UCS2).

During data profiling, you found that the data has a variety of formats for the Product Type attribute. To standardize the format, you created the following mask recodes in data profiling.

Attribute = Product Type

Value

Mask Recoding Mask Mask Recode
MA8 4EE AAN NAA AA0N NAA MA08 4EE
MA9 2DD AAN NAA AA0N NAA MA09 2DD
MA4 2NN AAN NAA AA0N NAA MA04 2NN

Now you want to use this recode to generate a recode table and ddx and apply them to the Transformer.

To generate an attribute-based recode table

  1. Provide recodes for specific values or masks for individual attributes in data profiling.
  2. From the Navigation or Project View, right-click the Transformer process and select Edit Process. You can also double-click the process to open it for editing.
  3. Select the Output Conditionals tab.
  4. Right-click within the Transformation Rules pane (right-hand side) and select Add > New > Table Recode.
  5. Enter a description for this table recode operation. The description will be inserted in the Statistics File.
  6. Select Attribute as the source of recodes.
  7. Select an entity and one of the entity's attribute which has recoded values. After the selection, a preview button (Open values table or Open mask table) will become enabled.
  8. Click Open values table or Open mask table to display the attribute in a list view.
  9. Review the recodes and the effect they have on the attribute's values. When you are satisfied with the recode, click Next.
  10. Select the match attribute. This is the Transformer output attribute which will be matched against the original value in the attribute specified in Step 7.
  11. Select the destination attribute. This is the Transformer output attribute which will store the recoded value.
  12. Select the recode type (literal or mask).
  13. (Optional) Select Case sensitive matching to use case-sensitive matching. By default, matching is case-insensitive. The case sensitive option does not apply to masks, only to values.
  14. Click Finish. The scheduler notification bar opens.
  15. (Optional) Change the job name.
  16. Do one of the following:
    • To schedule the job to run immediately, click Now. The message closes and the job begins running in the background.
    • Click Later. The Set Date & Time scheduler window opens. To immediately run the job in the background, click Run Now. To schedule a time, select a starting date on the calendar and select a time you want the job to run on the selected date. Click Submit to save the scheduled time.
    • Click Cancel to cancel the task.
    Note: All scheduled jobs are run in the background. Note the progress in the Background Tasks List View.

    The recode table and its ddx file will be created in path\repository_name\projectN\tables, where path is the path to the repository directory (specified during installation), repository_name is the name of your repository and N is the number of your project. For example, C:/ProgramData/Trillium Software/.../repository1/project1/tables.

    When the Transformer is run, it will apply the recodes to the output attributes.