How Does AddrScan Processing Work? - Finalist - 10.0

Finalist Guide

Product type
Software
Portfolio
Verify
Product family
Data Quality - Postals
Product
Finalist
Version
10.0
Language
English
Product name
Finalist
Title
Finalist Guide
Copyright
2023
First publish date
1984
Last updated
2024-05-15
Published on
2024-05-15T00:57:22.811031

AddrScan processing includes the following steps:

  1. Identify the address lines. AddrScan identifies each input address line and assigns an input line type code to each line.

Table 1. Input Line Type Codes
Line Type Code Description
0 Firm line
1 PO box address line
2 Address line with both a range and a suffix word
3 Address line with a range but no suffix

Address line with a suffix but no range

4 Rural route address line
5 Personal name

Firm name (without firm words)

Unidentified

6 Apartment type line
7 Possible city line
8 City, state, ZIP Code (or ZIP + 4) line
9 Ignore this line
B Box address line following a rural route address line
C City line
M Military address line
P Urb (P on mainframe, p on Windows and *Nix)
R Rural route address line preceding a box line
S State line
W PMB line
X City and state line
Y State and ZIP Code line
Z ZIP Code line
Blank No data present on input line
  1. Concatenate related data from the six input address lines. AddrScan uses the line type codes to determine which lines can be combined to form standardized addresses. The following concatenation rules determine the address line combination.

Table 2. Concatenation Rules
Rule Description
Line type code 6 lines append to line type code 2 lines.
Line type code 6 lines append to line type code 3 lines.
Line type code 8 lines append to line type code 7 lines.
Line type code B lines append to line type code R lines.
Note: AddrScan combines lines only up to the point where the receiving line can hold all of the combined data. For example, assume there are three lines to combine, but the receiving line only has enough room for the first two lines. AddrScan will combine only the first two lines and leave the existing third line.
  1. Lines with a line type code 6, 7, or B append to the original street/city/rural line and form a new concatenated line.
  2. The new concatenated line overlays the original street/city/rural line.
  3. AddrScan "reassigns" the line type code 6, 7, and B lines a new line type code of "9".
  4. These lines are now part of the new concatenated line used to build the standardized address. In effect, the line type code "9" means "ignore this address line," and thereby prevents the lines from appearing twice in the standardized address. The following example demonstrates line type code assignment.

    An example of an address passed for concatenation follows. Note the input line type codes assigned to each line during the identification process.

    PO Box 123 ** 1 **

    123 Main Street ** 2 **

    Apt 12A ** 6 **

    Roseville MI 48066 ** 8 **

    AddrScan appends the third line (input line type code 6) to the second line (input line type code 2). After concatenation, the address lines are ready for standardization.

  1. Standardize the address lines. Standardized lines are the original lines from your input file that are corrected and returned to the user input area (replacing your original input lines). The AddrScan standardization process steps are:
    1. Return the standardized lines in any order. AddrScan places the standardized lines in the order identified by codes passed from the calling program. If you enter invalid codes, the system returns the lines in the order entered.
    2. Return standardized line type codes to identify each of the standardized lines.

Table 3. Standardized Line Type Codes
Line Type Code Description
0 Firm
1 P. O. box
2 Street (with suffix)
3 Street (without suffix)
4 Rural route
5 Personal name (probable)
8 City/state/ZIP Code
9 Miscellaneous (i.e., C/O)
M Military
P Puerto Rico URB Data
Note: Displays "P" (upper case) for the z/OS platform. For all other platforms, displays "p" (lower case).
  1. Return the processed lines.
    1. Build the standardized address in the order requested. If no order is requested, the lines are returned in the order entered.
    2. Identify the ZIP + 4 Code.
    3. Remove the ZIP + 4 Code from the returned city line and place the ZIP + 4 Code in a fixed field.

      AddrScan only identifies the ZIP + 4 Code during this process. The Finalist API verifies the ZIP + 4 Code.

    4. Build two four-byte line indexes. The line indexes help you relate the returned address to your original input record and to the standardized address. Index1 contains the line numbers in your original input record that correspond to the returned lines. Index2 contains the line numbers in the standardized address that correspond to the returned lines.

Table 4. Index Pointer Table
Position Description
1 Source of Returned Line 1
2 Source of Returned Line 2
3 Source of Puerto Rico urbanization data (Index1 only)
4 Source of Returned City Line

The ZIP Code is returned in a separate field and does not display in the returned address lines.

  1. Build the returned lines address by applying the format specified (Returned Line Options) to the standardized address to produce the returned lines address. AddrScan returns four 70-byte lines.
Table 5. Returned Line Contents
Line Number Description
Line 1 Street address data
Line 2 Street address data
Line 3 City and state
Line 4 Firm name (if supplied in the input record)
  1. Pass the four-line address back to the calling program in the requested format. For additional examples, refer to AddrScan Examples.

    Example

    The Returned Line Options are set to "56N". The following table describes this setting.

Table 6. Returned Line Options Example
Parameter Position Description
Return Line 1 5=Return first valid line above city line into Return Line 1
Return Line 2 6=Return second valid line above city line into Return Line 2
Return Line 3 N=No special options used
Return Line 4 Remains blank - no firm line present
  1. Identify the input lines that were not used during processing.

    Example

    AddrScan used lines 1, 2, 3, and 5 to process the input address. Input lines 4 and 6 were not used. The unUsedLines (ADDRPASS-UNUSED) six-byte field displays:

    46<blank><blank><blank><blank>

    Blanks are used as placeholders to fill the six-byte field.