Parser attributes can be described by various attribute modifiers. This topic describes all definition modifiers that can be used after the attribute assignment. Valid attribute modifiers are:
- Gender
- Category
- Function
- Recode
Gender (CDP only)
The Gender (GEN=) keyword assigns a gender to a name component. It applies only to definitions for name lines and is required if the attribute used is GVN-NM1 or 2. Valid gender codes are:
- M = Male
- F = Female
- N = Neuter (gender unknown)
Category
The category (CAT=) keyword is a user-defined, free-form means of categorizing data elements. Categories should be limited to six characters (based on assigning multiple categories throughout a record) with a maximum of 50 bytes per record for all categories. A category can be any value that may prove useful as a group during Parsing of name and address components or free-form business data elements.
In the BDP, the category keyword is defined in the word definition table as a classification. Classifications are added to attributes to clarify or refine meaning.
Examples
- In the CDP, assigning SIC codes to company names allows the distribution of customer business verticals to be analyzed after the parsing process is complete: 'BOY SCOUTS' NAME DEF ATT=BUS,CAT=S8641
- In the BDP, you could classify the different brands of clothing as sport, casual, and dress: 'HELLY HANSEN' INSERT MISC DEF ATT=BRAND CAT='CASUAL'
Function
The function (FUNC=) keyword is used when special functions should be performed on the entry. This keyword specifies a certain subroutine, and the functions of that subroutine act on the entry. See Function Attribute Modifiers.
‘BOY SCOUTS’ NAME DEF ATT=BUS,FUNC=’BES01’
Recode
The recode (REC= or RECODE=) keyword is used to recode the value. The value assigned after recode is the value the Parser will assign to the recode output field when the defined word is encountered on input.
Examples
- In the CDP, the parser recodes the word ‘ROAD’ to ‘RD’ as follows: 'ROAD'
STREET END ATT+STR-TYPE,REC=RD
Where,
- 'ROAD' is the value stored in the original data field on Parser output.
- RD is the value stored in the recoded Parser output attribute.
- In the
BDP, recodes can be added to word and phrase definitions after the definitions have
been added to the Customized Definition table. For example, you can recode the
company name of Helly Henson to HH:
'HELLY HANSEN' INSERT MISC DEF ATT=BRAND RECODE='HH'
Where,
- 'HELLY HANSEN' is the value stored in the original data field on Parser output.
- HH is the value stored in the recoded Parser output attribute.
Recode for Masks
Masks may be used to introduce and/or exclude literals and special characters in their recodes.
Examples
- In the CDP, for the mask recode of the nine-digit postcode from '01821-3961' to
'018213961', enter:
'nnnnn-nnnn' MASK GEOG END ATT=POSTCODE',REC='nnnnnnnnn'
- In the BDP, the mask recode of product code TT1234 to TT-1234 would look
like:
'aannnnn' INSERT MASK MISC DEF ATT=PCODE RECODE='aa-nnnnn'