The Parser Repository (PREPOS) contains all the parsed data attributes, including error codes, identification indicators, name information, street information and geographic information. Because of the size of the PREPOS, only selected attributes are written to the output file by default. You can select and add any PREPOS attributes to your output by using Add Parser Outputs in the Schema Editor.
Guidelines
- Attribute lengths are provided in bytes due to Unicode support. You can obtain the total number of characters by dividing the attribute length by two. When the lengths are 200, the total of 99 characters (not 100 characters) are displayed except the pr_busname_original and pr_busname_recoded fields for which the total of 100 characters are displayed.
- If you are processing UTF-8 data (variable-length character encoding), you must add the xx_gin fields (xx = country code) to the PREPOS DDL.
The following table lists the PREPOS attributes.
Attribute |
Length |
Description |
---|---|---|
pr_completion_code |
2 |
Receives the parser return codes. |
pr_filler_01 |
8 |
Reserved for future use. |
pr_orig_line_pattern |
20 |
Stores the original line pattern of ten (10) input address lines corresponding to the in_area definitions. The line pattern is determined through a combination of weighting the orientation of the line within the full address and the number and weight of the elements identified on the line. Values are:
|
pr_line_rules |
40 |
Contains the two-byte line type identification rule. Values are:
|
pr_name_review_codes |
3000 |
Stores the review code for names. |
pr_street_review_codes |
300 |
Stores the review code for streets. |
pr_geog_review_codes |
300 |
Stores the review code for geography attributes. |
pr_misc_review_codes |
3000 |
Stores the review code for miscellaneous problems. |
pr_global_review_codes |
300 |
Stores the review codes for global review problems. |
pr_number_of_input_names |
4 |
Contains the number of names (personal and business) on the input record. Value is a two-digit number from 00 to 10. |
pr_number_of_output_names |
4 |
Contains the number of names (personal and business) on the output record. |
pr_name_types |
2 |
Contains a categorization value based on the name types identified on the input record. Values are:
|
pr_category |
100 |
Stores the standard or user-defined category codes assigned to word definitions in the Word and Pattern tables. Note that concatenated codes from the overall address are terminated with a character. |
pr_rev_group |
3 |
Provides review group codes that guide you to areas for parser tuning. |
pr_house_number_recoded |
30 |
Stores the input house value: for example, 5 Main Street. |
pr_house_number_original |
30 |
Stores the original input house value. |
pr_house_mask |
30 |
Stores the shape of the input house values:
|
pr_house_type |
2 |
Stores the input house type. Values are:
|
pr_street_name_recoded |
200 |
Stores input street title: for example, Main, Elm, and so forth. |
pr_street_name_original |
200 |
Stores the original input street title. |
pr_street_type1_recoded |
30 |
Stores the first street type input: for example, Rd. or St. |
pr_street_type1_original |
30 |
Stores the original first street type input. |
pr_street_type2_recoded |
30 |
Stores second street type input. |
pr_street_type2_original |
30 |
Stores original second street type input. |
pr_street_primary_dir_recoded |
24 |
Stores the input prefix street direction, such as N, S, W, E, and so forth. This attribute is populated depending on the position of the incoming direction. The pattern structure determines the final answer. Values are:
|
pr_street_primary_dir_original |
24 |
Stores the original prefix street direction input. |
pr_street_secondary_dir_recoded |
24 |
Stores the post street direction input. This attribute will or will not be populated depending on the position of the incoming direction. The pattern structure determines the final answer. Values are:
|
pr_street_secondary_dir_original |
24 |
Stores the original post street direction input. |
pr_route_name_recoded |
40 |
Stores the input route name: for example, Route, Rt. and so forth. |
pr_route_name_original |
40 |
Stores the input route name. |
pr_route_number |
16 |
Stores the input route value. |
pr_route_mask |
16 |
Stores the shape of the input route value. |
pr_route_number_type |
2 |
Stores input route type. Values are:
|
pr_box1_name_recoded |
40 |
Stores the input box name. |
pr_box1_name_original |
40 |
Stores the original input box name. |
pr_box1_number |
20 |
Stores the input box value: for example, 6, 12, and so forth. |
pr_box1_mask |
20 |
Stores the shape of the input box value. Values are:
|
pr_box1_type |
2 |
Stores the input box type. Values are:
|
pr_box2_name_recoded |
40 |
Stores the input box name. |
pr_box2_name_original |
40 |
Stores the original input box name. |
pr_box2_number |
20 |
Stores the input box value: for example, 6, 12, and so forth. |
pr_box2_mask |
20 |
Stores the shape of the input box value. Values are:
|
pr_box2_type |
2 |
Stores the input box type. Values are:
|
pr_complex1_name_recoded |
200 |
Stores the standardized name of an apartment, military, or business complex: for example, stores "Marvin" in "Marvin Gardens". |
pr_complex1_name_original |
200 |
Stores the original input name of an apartment, military, or business complex. |
pr_complex1_type_recoded |
30 60 (United Kingdom) |
Stores the standardized type of apartment, military, or business complex: for example, stores "Gardens" in "Marvin Gardens". |
pr_complex1_type_original |
30 60 (United Kingdom) |
Stores the original type of apartment, military, or business complex. |
pr_complex2_name_recoded |
200 |
Stores the standardized name of a secondary apartment, military, or business complex. |
pr_complex2_name_original |
200 |
Stores the original input name of a secondary apartment, military, or business complex. |
pr_complex2_type_recoded |
30 60 (United Kingdom) |
Stores the standardized type of a secondary apartment, military, or business complex. |
pr_complex2_type_original |
30 60 (United Kingdom) |
Stores the original type of a secondary apartment, military, or business complex. |
pr_complex3_name_recoded |
200 |
Stores the standardized name of a tertiary apartment, military, or business complex. |
pr_complex3_name_original |
200 |
Stores the original input name of a tertiary apartment, military, or business complex. |
pr_complex3_type_recoded |
30 60 (United Kingdom) |
Stores the standardized type of a tertiary apartment, military, or business complex. |
pr_complex3_type_original |
30 60 (United Kingdom) |
Stores the original type of a tertiary apartment complex, military complex, or business complex. |
pr_dwelling1_name_recoded |
200 |
Stores the standardized type of dwelling when pairs of two-part dwelling tokens exist. This attribute is filled left-to-right, top-to-bottom. |
pr_dwelling1_name_original |
200 |
Stores original dwelling type when pairs of two-part dwelling tokens exist. |
pr_dwelling1_number |
20 |
Stores the dwelling value when pairs of two-part dwelling tokens exist. |
pr_dwelling1_mask |
20 |
Stores the shape of the dwelling value when pairs of two-part dwelling tokens exist. Values are:
|
pr_dwelling1_type |
2 |
Stores the dwelling type value when pairs of two-part dwelling tokens exist. Values are:
|
pr_dwelling2_name_recoded |
200 |
Stores standardized type of the secondary dwelling when pairs of two-part dwelling tokens exist. Attribute is filled left-to-right, top-to-bottom. |
pr_dwelling2_name_original |
200 |
Stores the original type of secondary dwelling when pairs of two-part dwelling token exist. No recodes are applied. |
pr_dwelling2_number |
20 |
Stores the secondary dwelling value when pairs of two-part dwelling tokens exist. |
pr_dwelling2_mask |
20 |
Stores the shape of the secondary dwelling value when pairs of two-part dwelling tokens exist. |
pr_dwelling2_type |
2 |
Stores the dwelling type value when pairs of two-part dwelling tokens exist. Values are:
|
pr_dwelling3_name_recoded |
200 |
Stores standardized type of the tertiary dwelling when pairs of two-part dwelling tokens exist. Attribute is filled left-to-right, top-to-bottom. |
pr_dwelling3_name_original |
200 |
Stores the original type of tertiary dwelling when pairs of two-part dwelling tokens exist. |
pr_dwelling3_number |
20 |
Stores the tertiary dwelling value when pairs of two-part dwelling token exist. |
pr_dwelling3_mask |
20 |
Stores the shape of the tertiary dwelling value when pairs of two-part dwelling token exist. |
pr_dwelling3_type |
2 |
Stores the tertiary dwelling type value when pairs of two-part dwelling token exist. Values are:
|
pr_misc_address |
2000 |
Stores the miscellaneous address values. |
pr_best_street_number |
20 |
Best number composite: - If house # exists, use house number. - If APT # exists, use APT number. - If box # exists, use box number. - If route # exists, use route number.
Number is used primarily in the relationship linking process. |
pr_best_street_name |
200 |
Best street title composite: - If street name exists, use it. - If complex name/type exists, use it. - If box name exists, use it.
Number is used primarily in the relationship linking process. |
pr_country_name_recoded |
200 |
Stores the country name. Populated when a country attribute is found within address lines (not assigned by the program). |
pr_country_name_original |
100 |
Stores the original country name. |
pr_neigh1_name_recoded |
200 |
Stores the neighborhood 1/locality name. |
pr_neigh1_name_original |
200 |
Stores original neighborhood 1/locality name. No recodes are applied.
Examples: Bo Barcelona, URB El Duque
This attribute is typically used in data from the UK and Latin America. |
pr_neigh2_name_recoded |
200 |
Stores the neighborhood 2/locality name. |
pr_neigh2_name_original |
200 |
Store the original neighborhood 2/locality name. |
pr_city_name_recoded |
200 |
Stores the city name. |
pr_city_name_original |
200 |
Stores the original city name. |
pr_city_number |
12 |
Stores the city number. This value is assigned in the parser and is generally used as a primary look-up key in any postal matching process.
Examples : V22663, (ANNNNNN) |
pr_city_status |
2 |
Stores the city status numeric that helps to determine the correct city name. Values are:
|
pr_city_lname_dir |
100 |
Stores the long city name from the directory and displays the city and state abbreviations. |
pr_state_recoded |
60 |
Stores the state, province, or county name. |
pr_state_original |
60 |
Stores the original state, province, or county name. |
pr_state_number |
4 |
Stores the state number. |
pr_postal_code |
30 |
Stores the input postal code.
Examples: 01821, 01879, M8X 2X3 |
pr_postal_code_mask |
30 |
Stores the shape of the input postal code (pr_postal_code) mask. |
pr_postal_code_type |
2 |
Stores the type of input postal code. |
pr_postal_code_dir |
30 |
Stores the postal code found on the |
pr_postal_code_mask_dir |
30 |
Receives the shape of the postal code identified by the CDP (pr_postal_code_dir) mask. |
pr_postal_code_type_dir |
2 |
Contains codes assigned by the USPS, found in the 0 - Non-US record 4 - Box Code 5 - Special Military 8 - Non Unique Other - Non-US record |
pr_world_origin |
2 |
Contains the country that the record is from. 1=United States (US) 2=Canada (CA) 3=United Kingdom (UK) 4=Other 5=Brazil (BZ) 6=Australia (AU) 7=Germany (DE) 8=Italy (IT) |
pr_postal_code_index_0x (x=1-6) |
12 |
Contains US postal finance code (and other future foreign postal info). |
pr_geocd_x |
20 |
Contains US postal finance code (and other future foreign postal info) where |
pr_filler_02 |
168 |
INPUT GEOGRAPHIC MATCH SECTION—See country-specific postal matcher documentation for other country formats. OUTPUT GEOGRAPHIC MATCH SECTION—See country-specific postal matcher documentation for other country formats. |
pr_name_sect_01 |
2 |
This and all name attributes repeat nine (9) times for a total of ten (10) names. Each section is described by one attribute (for example, pr_name_sect_01, pr_name_sect_02, and so on) and redefines the entire 1982 byte area. Note: If any attribute is removed from the segment then the length of 1982
will be changed to reflect the new length.
|
pr_name_number_01 |
4 |
Stores the number representing the occurrence of this name within the record (01-10). If generating records, this contains 01 for first name encountered, 02 for 2nd name, and so forth. |
pr_name_form_01 |
2 |
A one-byte attribute in each of the ten (10) name segments in the PREPOS indicating whether the name is in consumer or business form. Values are:
If the address has no name line recognized, this attribute is not populated.
An ignore line becomes a miscellaneous line type. Note that this should be the only occasion when the name form attribute is not populated. |
pr_name_prefix_recoded_01 |
30 |
Stores the personal name prefix, using a recoded word: for example, titles such as Mr., Ms., and Dr. |
pr_name_prefix_original_01 |
30 |
Stores the original personal name prefix display: for example, titles such as Mr., Ms., and Dr. No recodes are applied. |
pr_given_name1_recoded_01 |
100 |
Stores the personal name, given name 1, using a recoded word: for example, John, Joseph, or Jillian. |
pr_given_name1_original_01 |
100 |
Stores the original personal name, given name 1 display: for example, John, Joe, or Jill. No recodes are applied. |
pr_given_name2_recoded_01 |
100 |
Stores the personal name, given name 2, using a recoded word. |
pr_given_name2_original_01 |
100 |
Stores the original personal name, given name 2 display. No recodes are applied. |
pr_given_name3_recoded_01 |
100 |
Stores the personal name, given name 3, using a recoded word. |
pr_given_name3_original_01 |
100 |
Stores the original personal name, given name 3 display. No recodes are applied. |
pr_given_name4_recoded_01 |
100 |
Stores the personal name, given name 4, using a recoded word. |
pr_given_name4_original_01 |
100 |
Stores the original personal name, given name 4 display. No recodes are applied. |
pr_surname1_recoded_01 |
100 |
Stores a personal name, surname display, using a recoded word. |
pr_surname1_original_01 |
100 |
Stores the original personal name, surname display. No recodes are applied. |
pr_surname2_recoded_01 |
100 |
Stores a personal name, surname display, using a recoded word. |
pr_surname2_original_01 |
100 |
Stores the original personal name, surname display. No recodes are applied. |
pr_name_generation_recoded_01 |
20 |
Stores the personal name, generation display, using a recoded word: for example, Jr., Sr., and so forth. |
pr_name_generation_original_01 |
20 |
Stores the original personal name, generation display. No recodes are applied. |
pr_name_suffix_recoded_01 |
30 |
Personal name, suffix display, using a recoded word: for example, titles after a personal name, such as DMD, ORTH, and so forth. |
pr_name_suffix_original_01 |
30 |
Stores the original personal name, suffix display: for example, titles after a personal name, such as DMD, ORTH, and so forth. No recodes are applied. |
pr_name_gender_01 |
2 |
Contains a personal name, gender, using a recoded word. Values are:
|
pr_busname_recoded_01 |
200 |
Stores the business name, using a recoded word. |
pr_busname_original_01 |
200 |
Stores the original business name display: for example, International Business Machines. No recodes are applied. |
pr_name_connector_recoded_01 |
30 |
Stores the Connector, last name display, using recoded word. |
pr_name_connector_original_01 |
30 |
Stores the original Connector display. No recodes are applied. |
pr_name_relation_recoded_01 |
50 |
Contains the parser-identified standardized relationship: for example, Trustee for, Executor for, or In Trust for. |
pr_name_relation_original_01 |
50 |
Contains the parser-identified input relationship. |
pr_name_original_line_number_01 |
4 |
Original line number (01-10). |
pr_name_category_01 |
50 |
Concatenated codes from category. |
pr_line_xx |
200 |
Contains the address line where |
pr_pattern |
600 |
Stores ten (10) three-character token identifiers for each name and address line. See Numerical List of Attributes.
Example: 054060054051053 |
pr_line_type |
20 |
Contains the line types of the original input address lines. Values are:
|
pr_user_field_xx |
200 |
Used to be a function of a parm offset where xx indicates a number between 00 and 10. |
xx_postal_match_input_area |
600 |
Contains the attributes specific to each postal matcher, where xx indicates the country code. Values include the list of all input attributes. |
pr_bustype_recoded_01 | 40 | Stores business type / descriptive. Repeated 9 times. |
pr_bustype_original_01 | 40 | Stores the original business type / descriptive. Repeated 9 times. |
pr_street_name_recoded_2 | 200 | Stores the input street title for the secondary street. |
pr_street_name_original_2 | 200 | Stores the original input street title for the secondary street. |
pr_street_type1_recoded_2 | 30 | Stores the first street type for the secondary street. |
pr_street_type1_original_2 | 30 | Stores the first original street type for the secondary street. |
pr_street_type2_recoded_2 | 30 | Stores the second street type for the secondary street. |
pr_street_type2_original_2 | 30 | Stores the second original street type for the secondary street. |
pr_street_primary_dir_recoded_2 | 24 |
Stores the input prefix street direction, such as N, S, W, E, and so forth for the secondary street. |
pr_street_primary_dir_original_2 | 24 | Stores the original prefix street direction for the secondary street. |
pr_street_secondary_dir_recoded_2 | 24 |
Stores the post street direction for the secondary street. |
pr_street_secondary_dir_original_2 | 24 | Stores the original post street direction for the secondary street. |
pr_misc_address_01 | 2000 | Stores the miscellaneous address values. Repeated 9 times. |
pr_sub_city | 200 |
Stores the sub-city name. Czech only. Note: For Hong Kong, the Additional geography lookup option returns island
for some cities in the pr_sub_city attribute.
|
pr_part_of_city | 200 | Stores the part-of-city element. Czech only. |
pr_changed_flags | 230 (Redefine) | Stores the change flag. See Customer Parser Change Flag for details. |
pr_verified_geography | 1 |
Stores the flag for the Additional geography lookup option. For Hong Kong, this option also returns island for some cities in the pr_sub_city attribute. Values are: Y—province/city/postcode combination validated. N—province/city/postcode combination not validated. Note: The pr_verified_geography attribute is not available for the following
countries: Basic Countries (ZZ template), Canada, China, Japan, Korea,
Netherlands, Portugal, Singapore, Taiwan , and the United Kingdom.
|