SOUNDEX2 Routine with Modifier (S7) - trillium_discovery - trillium_quality - 17.1

Trillium Control Center

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Trillium > Trillium Quality
Version
17.1
Language
English
Product name
Trillium Quality and Discovery
Title
Trillium Control Center
Topic type
Administration
Overview
How Do I
Configuration
Reference
Installation
First publish date
2008

The S7 modifier is used with the SOUNDEX2 routine to ensure compatibility with Series 7. The S7 modifier applies the same algorithm as in Series 7 to SOUNDEX2 routine. Use S7 when you want to obtain the same linking results as Series 7.

Note: All SOUNDEX related routines are limited to working with attribute sizes of 256 characters. If the attribute length exceeds 256 characters, the algorithm will be applied to the first 256 characters for all SOUNDEX routines. The algorithm will be applied to the last non-blank 256 characters for all RSOUNDEX routines.

Algorithm

  1. Capitalize all characters in the string.
  2. Replace all non-leading vowels with A.
  3. Transform the following prefixes:
    From characters To characters

    MAC

    MCC

    KN

    NN

    K

    C

    PF

    FF

    SCH

    SS

    PH FF
  4. Transform these character combinations that occur after the first position:
    From characters To characters

    DG

    GG

    CAAN

    TAAN

    D

    T (not iterative)

    Q G (not iterative)

    NST

    NSS

    AV

    AF

    Z

    S (not iterative)

    M

    N (not iterative)

    KN

    NN

    K

    C (not iterative)

    AW A
    PH FF
    SCH SSS
  5. Replace H with A unless it is preceded and followed by A (for example, AHA).
  6. Remove all A characters except for the leading A.
  7. Remove all but the first of repeating adjacent character substrings.
  8. Return the first four characters.
Table 1. Scoring for SOUNDEX2 With Modifier S7

Score

Description

0

Case 1: Both strings are blank

Case 2: Neither string is blank and they are not equal.

50

One string is blank; the other string is non-blank.

100

Strings are equal.