Attribute Modifiers - trillium_discovery - trillium_quality - 17.1

Trillium Parser Tuner

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Quality
Trillium > Trillium Discovery
Version
17.1
Language
English
Product name
Trillium Quality and Discovery
Title
Trillium Parser Tuner
Topic type
How Do I
Overview
Configuration
Reference
Administration
Installation
First publish date
2008

Parser attributes can be described by various attribute modifiers. This topic describes all definition modifiers that can be used after the attribute assignment. Valid attribute modifiers are:

  • Gender
  • Category
  • Function
  • Recode

Gender (CDP only)

The Gender (GEN=) keyword assigns a gender to a name component. It applies only to definitions for name lines and is required if the attribute used is GVN-NM1 or 2. Valid gender codes are:

  • M = Male
  • F = Female
  • N = Neuter (gender unknown)

Category

The category (CAT=) keyword is a user-defined, free-form means of categorizing data elements. Categories should be limited to six characters (based on assigning multiple categories throughout a record) with a maximum of 50 bytes per record for all categories. A category can be any value that may prove useful as a group during Parsing of name and address components or free-form business data elements.

In the BDP, the category keyword is defined in the word definition table as a classification. Classifications are added to attributes to clarify or refine meaning.

Examples

  • In the CDP, assigning SIC codes to company names allows the distribution of customer business verticals to be analyzed after the parsing process is complete: 'BOY SCOUTS' NAME DEF ATT=BUS,CAT=S8641
  • In the BDP, you could classify the different brands of clothing as sport, casual, and dress: 'HELLY HANSEN' INSERT MISC DEF ATT=BRAND CAT='CASUAL'

Function

The function (FUNC=) keyword is used when special functions should be performed on the entry. This keyword specifies a certain subroutine, and the functions of that subroutine act on the entry. See Function Attribute Modifiers.

‘BOY SCOUTS’ NAME DEF ATT=BUS,FUNC=’BES01’

Recode

The recode (REC= or RECODE=) keyword is used to recode the value. The value assigned after recode is the value the Parser will assign to the recode output field when the defined word is encountered on input.

Note: The keyword is written as REC= in the CDP and RECODE= in the BDP.

Examples

  • In the CDP, the parser recodes the word ‘ROAD’ to ‘RD’ as follows: 'ROAD' STREET END ATT+STR-TYPE,REC=RD

    Where,

    • 'ROAD' is the value stored in the original data field on Parser output.
    • RD is the value stored in the recoded Parser output attribute.
  • In the BDP, recodes can be added to word and phrase definitions after the definitions have been added to the Customized Definition table. For example, you can recode the company name of Helly Henson to HH:

    'HELLY HANSEN' INSERT MISC DEF ATT=BRAND RECODE='HH'

    Where,

    • 'HELLY HANSEN' is the value stored in the original data field on Parser output.
    • HH is the value stored in the recoded Parser output attribute.

Recode for Masks

Masks may be used to introduce and/or exclude literals and special characters in their recodes.

Examples

  • In the CDP, for the mask recode of the nine-digit postcode from '01821-3961' to '018213961', enter:

    'nnnnn-nnnn' MASK GEOG END ATT=POSTCODE',REC='nnnnnnnnn'

  • In the BDP, the mask recode of product code TT1234 to TT-1234 would look like:

    'aannnnn' INSERT MASK MISC DEF ATT=PCODE RECODE='aa-nnnnn'