About Soundexes - trillium_discovery - trillium_quality - 17.1

Trillium Control Center

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Trillium > Trillium Quality
Version
17.1
Language
English
Product name
Trillium Quality and Discovery
Title
Trillium Control Center
Topic type
Overview
Administration
Configuration
Installation
Reference
How Do I
First publish date
2008

A soundex is a coded identification of data values that Trillium has analyzed as "sounding" similar.

Certain data values in an attribute may sound like other values in the same attribute. Trillium groups data values with similar sounds and identifies them as a soundex.

Soundexes allow you to examine groups of values that might be related or the same, but because the spelling or data entry conventions are different, it may not be obvious that they represent the same person, company, city, and so on. To determine if there is a problem, you must examine these values. Soundexes help find duplicated data and misspellings, and give you the information you need to make decisions about data entry standards.

Soundex Codes

The soundex analysis maps characters (in a data value) to a short, four-character identifier, called a soundex code. You can use this code to identify which data values are associated as sound-alikes. This analysis is particularly useful for grouping short strings, such as first or last names. In certain cases, examining soundexes helps to identify data that is incorrect and flag a data entry problem. A soundex can also help to identify common record types.