Only after generating window keys and sorting rows by window keys, you can configure and run the Relationship Linker process. The Relationship Linker uses various linking algorithms called comparison routines to compare data. Matches are determined based on the attributes, comparison routine, score, grade, and pattern; these are collectively called "match rules." Based on the grade and pattern returned by the routines, the result of the comparisons is considered a match (P), a suspect match (S), or an unmatched (F). See Relationship Linker.
The Relationship Linker performs two types of linking:
consumer and business. The consumer type linking is performed at the household and individual levels, whereas the business type linking is performed at the company and contact levels. The default rules are given for each country project template. View the default rules and change if necessary before running the Relationship Linker.
To view the rules and run the Relationship Linker
-
From the Navigation or Project View, right-click the Relationship Linker process and select Edit Process.
-
Select the Process tab.
-
Confirm that Attribute containing record type is set to PR_NAME_FORM_01. This attribute contains the Consumer/Business flag and is used to load records in either the Consumer or the Business linking window. Valid values are 1 = Consumer, 2 = Business, 3 = Error (unknown), Blank = No name present or identified.
-
Confirm that Attribute containing window key is set to WINDOW_KEY_01.
-
Select the Tools tab.
-
Click Launch Rules Editor. The Relationship Linker Rules Editor opens. Notice that there are four tabs at the top showing the rules files names. Those files contain the match rules for business and consumer linking. For example, the default files for the US project are as follows:
- p8_usbus1fld.stx - Match rules for business at company level.
- p8_usbus2fld.stx - Match rules for business at contact level.
- p8_uscon1fld.stx - Match rules for consumer at household level.
- p8_uscon2fld.stx - Match rules for consumer at individual level.
-
Click one of the tabs and select Window > Tile Horizontally. The Grade Pattern Editor and Field List Editor open and display the match rules.
-
First, review the rules in the Field List Editor. For example, p8_usbus1fld.stx looks like this:
Field List Editor
-
Field Name 1. Attributes to compare. The typical attributes for linking are business and personal names, primary and secondary address and geography components.
-
Comparison Routine. Matching algorithm. Trillium provides optimal algorithms tailored to the type of attributes. For example, the BUSNAME routine is specifically designed to compare business name attributes.
-
Score. The numeric value from 0 to 100. Each comparison routine employs a unique score system to measure the similarity of the data. For example, the ABSOLUTE routine returns a score of 100 for exact match and a 0 for all other cases.
In this example, there are six attributes to compare: business name, street name, house number, PO Box number, postal code, and city name. You use the BUSNAME comparison routine for business name and set up the score threshold of 100, 90, and 80. This means that if the records are exact match for business name, it receives a score of 100 (an A). If the number of word errors is more than one third the greater number of words in both attributes, it receives a score of 90 (a B). The STREETS, HOUSENO, ABSOLUTE, and PARTIAL1 routines are used for other attributes.
-
Next, review the rules in the Grade Pattern List. For example, p8_usbus1fld.stx looks like this:
Grade Pattern List
-
Category. The type of match represented by the pattern of grades. The values are P (Pass - matched), S (Suspect - suspect match), and F (Fail - unmatched).
-
Pattern ID. A unique, user-defined identification number for the combination of matching attributes and grades. Patterns define the criteria that determine whether the match is considered a pass, suspect, or fail.
-
Grade. The letter grade from A to E that corresponds to the numeric score. You can specify up to five grade thresholds. For example, a score 100 is grade A, a score 95 is grade B, and so on. A hyphen (-) is a wildcard.
In this example, the first two grade patterns are set as follows:
- F990 - - - - BB. If the record received a score of B on postal code and city, it will be a Fail (F) with Pattern ID 990.
- P100 AAAA - - . If the record received a score of 100 (an A) on business name, street name, house number, and PO Box, it will be a Pass (P) with Pattern ID 100.
-
Repeat Step 7 through 9 for each rules file and review the match rules. If you need to change any of the rules for tuning, do it here and select File > Save.
-
Close the Relationship Linker Editor.
-
Click Finish if you changed any of the match rules to save the changes. Otherwise click Cancel.
-
Run the Relationship Linker process.
The matching is performed. Now you can move on to Step 4, Examine Match Results.