When you import data, Trillium determines which data rows are potentially a duplicate of other rows for an attribute in the entity. You access these discovered row duplicates through an attribute's Summary Data View.
To examine potential duplicate data rows
- In the Navigation View, click the Discover bar, then click the Entities tab.
- Expand the entity that contains the attribute whose duplicate rows you want to examine.
- Click the attribute. The Summary Data View opens for the selected attribute. (If the view does not open, click the Metadata Summaries icon on the Home tab.)
- Click the Content Summary tab and click Validity to expand it.
- Note the Value Distribution %, which indicates the distribution of distinct values. If it is not an expected percentage, there may be duplicate row values. For example, if you expect 100% distinct values (no duplicate values), and there is a Value Distribution % of 71, then there are duplicate values in the attribute.
-
Under View Details, click Distinct Values. The Values List View opens.
- Columns in List View
-
Column Name
Description
Value
Contains the duplicated value.
Frequency
Tells you how many rows are possible duplicates.
Dist%
Shows in what percentage of the attribute the duplication occurs.
- Examine the Frequency column. Any value greater than 1 indicates a duplicated value.
-
Right-click a row and select Drill down to matching rows. A filtered Data
Row List View of matching duplicate rows displays. Examine the rows to verify that
the information is or is not a duplicate.
Note: You can add a note to the attribute to describe the issue. You can also export the information, either to a file on your system or to the server where it can be submitted as an input file to a Quality data module for processing. See Saving Data in List Views.