Configure Sampling and Finish Discovered Keys - trillium_discovery - Latest

Trillium Discovery Center

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Version
Latest
Language
English
Product name
Trillium Discovery
Title
Trillium Discovery Center
Copyright
2024
First publish date
2008
Last updated
2024-10-18
Published on
2024-10-18T14:55:05.094442

The final steps to discovering a key are to configure sampling options then run the analysis. When you discover keys, the analysis is run against a sample of your full data set. To configure the sample options, accept the defaults or customize the options as needed.

To configure sampling and finish discovered key analysis

  1. In the Discover Keys window, click the Finish tab.
  2. To configure sampling options, keep the defaults or change them as follows:
    Sampling Option Description

    Number of sample rows:

    Enter the total number of rows you want analyzed. The default is the total number of rows in the data source.

    Note: If you have more than 10,000 rows, the default analysis process is performed on a maximum sample of 10,000 rows. It is not recommended to go beyond this value. Performance will decrease.

    Sample:

    Select one of the following:

    • Every row - Include all rows in the sample.
    • Every nth - Enter a number to indicate how many rows to skip before selecting the next row to include.
    • Random - Include a random sample of rows in the analysis. Enter a percentage of the total rows to sample.
    Note: The random % is the percentage chance of rows being included. Therefore, the actual rows loaded may be different for each load of the same file even if you specify the same percentage.

    Start sampling at row:

    Enter the row position to begin the sample. The default is row 1.

  3. Optional: Click Reset to Defaults to revert options back to the defaults.
  4. Select Schedule job options:
    • Select Now (the default) to schedule the job to run when you click Finish.
    • Select Later to schedule the job to run at a later time. The current date and the time at the next half hour display. To change the date, click the calendar icon () and select a starting date. To change the time, click the clock icon () and select a time you want the job to run on the selected date.
  5. Click Summary to review all selections and make edits as needed. Click Back or In Progress to return to the Finish tab.
  6. Do one of the following:
    • Click Finish. A message displays confirming the analysis has been started or scheduled.
    • Click Cancel or the X icon to close the window without saving your work.
  7. Do one of the following:
    • Click Task Manager to open the Task Manager and monitor the status of the discovery analysis job if run now. Scheduled tasks show a status of pending.
    • Click Done to close the window.
  8. If the analysis discovered any keys, click Refresh to see the number of keys updated in the data source's Relationships table.