There are 17 predefined street parser rules, each of which is described in this topic. The rules are applied in the order listed, which means if you disable a rule, it may have an impact on the rules that follow.
Guidelines
- The six STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULEx rules are applied prior to pattern look up.
- The eleven STREET_LEVELx_RULEy rules are applied when a pattern is not found.
- It is a good idea to familiarize yourself with the terms used in the intrinsic attributes and specific attributes topics before proceeding.
- In this topic, horizontal lines are used to group related rules.
Allow only one COMPLEX-TYPE attribute to remain
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE1)
This rule allows only one COMPLEX-TYPE attribute to remain in a street pattern. Any preceding COMPLEX-TYPE attributes revert back to their intrinsic attribute. For example:
GRAND ESTATE APARTMENTS
ALPHA COMPLEX-TYPE COMPLEX-TYPE
becomes
ALPHA ALPHA COMPLEX-TYPE
Force any HIGHWAY or ROUTE attribute without pairs to STREET-TYPE
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE2)
This rule reassigns any HIGHWAY or ROUTE attribute without a paired number to STREET-TYPE. For example, in ROUTE 66, the pattern is ROUTE ROUTE-NUM. In 209 MIDDLESEX ROUTE, ROUTE is assigned a STREET-TYPE attribute.
Force any UNITS without pairs to COMPLEX-TYPE
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE3)
The UNIT and UNIT-NUM attributes are usually paired as the name and number of a unit or building (for example, Building 12). This rule assigns the COMPLEX-TYPE attribute to any UNIT without a corresponding UNIT-NUM. For example:
JOHN HANCOCK BUILDING
ALPHA ALPHA UNIT
becomes
ALPHA ALPHA COMPLEX-TYPE
Activate processing of double DIRECTIONS
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE4)
This rule combines two adjacent DIRECTION attributes into a single DIRECTION attribute. For example:
101 NORTH EAST MAIN STREET
HSNO DIR DIR ALPHA STREET-TYPE
becomes
HSNO DIR ALPHA STREET-TYPE
Split hyphens when processing houses (except in Hawaii, New York or valid range)
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE5)
If a token contains a hyphenated HSNO, this rule splits the token into two separate tokens (except as noted above). For example, if the token contains 10-11, the rule splits it into 10 and 11, and assigns the HSNO attribute to each.
Combine multiple ALPHA tokens
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE6)
This rule instructs the CDP to combine multiple ALPHA tokens into a single ALPHA token. For example:
GREAT QUEEN ROAD
ALPHA ALPHA TYPE
becomes
ALPHA TYPE
Note:The rule STREET_LEVEL6_RULE1 performs similarly to STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE6, except that it is run only after five failed pattern look ups. Therefore, both rules must be set to ‘N’ to avoid combining adjacent ALPHA tokens into a single token.
__________________________________________________
Skip leading CARE-OF attribute
(STREET_LEVEL2_RULE1)
This rule instructs the CDP to skip a token with the CARE-OF attribute if it is the first token on the line. For example:
C/O 123 MAIN STREET
CARE-OF HSNO ALPHA STREET-TYPE
becomes
HSNO ALPHA STREET-TYPE
Skip IGNORE attribute and everything that follows it
(STREET_LEVEL2_RULE2)
The rule tells the CDP to ignore a token with an IGNORE attribute, and any tokens that follow it.
Remove pairs of two-part tokens
(STREET_LEVEL2_RULE3)
This rule removes the readily identified pairs of two-part tokens and stores the data in the pr_dwelling_name and pr_dwelling_number fields. For example:
23 Davis Rd Floor 2 Apt 35
HSNO ALPHA STREET-TYPE FLOOR FLOOR-NUM APT APT-NUM
The FLOOR and APT pairs are removed from the line and pattern and the data stored in pr_dwelling1 name and number and pr_dwelling2 name and number fields, respectively.
_________________________________________________
Force STREET-TYPE1 and SEC-STR-TYPE to STREET-TYPE
(STREET_LEVEL3_RULE1)
This rule reassigns tokens of STREET-TYPE1 (leading street type) and SEC-STR-TYPE to STREET-TYPE. For example:
123 Bowling Green St
HSNO ALPHA SEC-STR-TYPE STREET-TYPE
becomes
HSNO ALPHA STREET-TYPE STREET-TYPE
Force STREET to ALPHA
(STREET_LEVEL3_RULE2)
Certain words are assigned a STREET-NAME attribute in the CDP table. This rule reassigns any STREET-NAME attribute to an ALPHA attribute. For example:
101 Main St
HSNO STREET-NAME STREET-TYPE
becomes
HSNO ALPHA STREET-TYPE
Force NUMBER to ALPHA
(STREET_LEVEL3_RULE3)
The NUMBER attribute is a house number or street name expressed as a word. For example, in 25 Fifth Ave, FIFTH would have a NUMBER attribute. The initial pattern is HSNO NUMBER STREET-TYPE. This becomes HSNO ALPHA STREET-TYPE after the rule is applied.
Force ALPHA-1SPECIAL with hyphen (-) or quote (") to ALPHA
(STREET_LEVEL3_RULE4)
The ALPHA-1SPECIAL attribute represents characters that are all alphabetic except for one that can be any non-alphabetic and non-numeric character. This rule reassigns the ALPHA attribute to any token that contains a hyphen or quote mark and has the ALPHA-1SPECIAL attribute. For example:
Werner-vonBraun Str 25
ALPHA-1SPECIAL STREET-TYPE NUMERIC
becomes
ALPHA STREET-TYPE NUMERIC
__________________________________________________
Force S-DIRECTION to DIRECTION
(STREET_LEVEL4_RULE1)
The S-DIRECTION attribute is typically the abbreviated form (N, S, E, NE, SE, NW, SW) of a DIRECTION (North, South, East, West, and so on). This rule reassigns an S-DIRECTION attribute to a DIRECTION attribute. For example:
101 N Main St
HSNO S-DIRECTION ALPHA STR-TYPE
becomes
HSNO DIRECTION ALPHA STR-TYPE
__________________________________________________
Force non-last TYPE to ALPHA
(STREET_LEVEL5_RULE1)
This rule assigns any STREET-TYPE attribute that is not the last in the line to an ALPHA attribute. For example:
55 MAPLE STREET RD
HSNO ALPHA STREET-TYPE STREET-TYPE
becomes
HSNO ALPHA ALPHA STREET-TYPE
Force a single ALPHA preceded by HSNO to APARTMENT-NUM
(STREET_LEVEL5_RULE2)
This rule assigns the attribute APARTMENT-NUM to a token with a 1ALPHA intrinsic attribute that is preceded by an HSNO token. For example:
101 A Main St
HSNO 1ALPHA ALPHA STREET-TYPE
becomes
HSNO APARTMENT-NUM ALPHA STREET-TYPE
__________________________________________________
Combine adjacent ALPHA tokens into a single ALPHA
(STREET_LEVEL6_RULE1)
This rule combines adjacent ALPHA tokens into a single token with an ALPHA attribute. This feature greatly simplifies patterns for multi-word street or complex names. For example:
63 Martin Luther King Blvd
HSNO ALPHA ALPHA ALPHA STREET-TYPE
or
68 John Kennedy Street
HSNO ALPHA ALPHA STREET-TYPE
both become
HSNO ALPHA STREET-TYPE
where the ALPHA attribute logically encompasses all of the original ALPHA tokens.
The rule STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE6 performs similarly to STREET_LEVEL6_RULE1, except that it is run prior to pattern look up. Therefore, both rules must be set to ‘N’ to avoid combining adjacent ALPHA tokens into a single token.