Sampling Options - trillium_discovery - trillium_quality - Latest

Inline Quality and Discovery

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Quality
Trillium > Trillium Discovery
Version
Latest
Language
English
Product name
Trillium Quality and Discovery
Title
Inline Quality and Discovery
Copyright
2024
First publish date
2008
Last updated
2024-10-18
Published on
2024-10-18T15:10:12.949492

This table describes the sampling options for loading data into a repository.

Option Meaning
All Load an entire file.
First

Load a select number of rows from the beginning of the file.

Use to test data characteristics of large files before loading the files (as entities) into a repository. Allows you to load only a small number of rows. For example, you might want to load the first 1000 rows, in order to determine level of data quality, schema accuracy, and other relevant information.

Random

Randomly sample a percentage of rows from the file.

Use when you need to load large files but want to test the data by loading a small number of rows first. For example, you might want to load a smaller row sample in order to understand the schema design, before loading all rows.

Note: The random % is the percentage chance of rows being included. Therefore, the actual number of rows loaded may be different for each load of the same file even if you specify the same percentage.