Data Profiling Patterns - trillium_discovery - 17.1

Trillium Administration Center

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Version
17.1
Language
English
Product name
Trillium Discovery
Title
Trillium Administration Center
Topic type
Reference
Overview
Configuration
Installation
How Do I
First publish date
2008

The Discovery Center uses patterns to describe the character shape of a data value. Each pattern represents the data as a series of codes. In the Discovery Center, users reference patterns to quickly identify deviations from the norm when analyzing data.

When you add a new repository, one of the parameters you define is the data profiling pattern. There are six profiling patterns available:

  • Default
  • Rich
  • Long
  • Greek
  • Hebrew
  • Turkish

How to Read Patterns

In the default pattern encoding, a means alpha, d means digit, p means punctuation, and so on. Wherever duplicate character codes occur in sequence (such as aaaa for four alphabetic characters in a row), a number is used to indicate how many alphabetic letters occur. In this case, the "aaaa" pattern is represented as "a4". For example, for the default pattern, the data value "Jane Smith" is represented as "a4_a5".

Click the following links to see more about pattern codes and examples.