The final steps to discovering a key are to configure sampling options then run the analysis. When you discover keys, the analysis is run against a sample of your full data set. To configure the sample options, accept the defaults or customize the options as needed.
- Sampling example:
-
If a data source has 1000 rows and you want to analyze the last 500 rows, use these options:
Number of sample rows
|
500 |
Start sampling at row
|
500 |
Sample
|
Every row |
To configure sampling and finish discovered key analysis
-
In the Discover Keys window, click the Finish tab.
-
To configure sampling options, keep the defaults or change them as follows:
Sampling Option |
Description |
Number of sample rows:
|
Enter the total number of rows you want analyzed. The default is
the total number of rows in the data source.
Note: If you have more than 10,000 rows, the default analysis process
is performed on a maximum sample of 10,000 rows. It is not
recommended to go beyond this value. Performance will
decrease.
|
Sample:
|
Select one of the following:
-
Every row - Include all rows in the sample.
-
Every nth - Enter a number to indicate how many rows to
skip before selecting the next row to include.
-
Random - Include a random sample of rows in the analysis.
Enter a percentage of the total rows to sample.
Note: The random % is the percentage chance of rows being included.
Therefore, the actual rows loaded may be different for each load of
the same file even if you specify the same percentage.
|
Start sampling at row:
|
Enter the row position to begin the sample. The default is row 1.
|
- Optional:
Click Reset to Defaults to revert options back to the defaults.
-
Select Schedule job options:
- Select Now (the default) to schedule the job to run when you click Finish.
- Select Later to schedule the job to run at a later time. The current date and the time at the next half hour display. To change the date, click the calendar icon () and select a starting date. To change the time, click the clock icon () and select a time you want the job to run on the selected date.
-
Click Summary to review all selections and make edits as needed. Click Back or In Progress to return to the Finish tab.
-
Do one of the following:
- Click Finish. A message displays confirming the analysis has been started or scheduled.
- Click Cancel or the X icon to close the window without saving your work.
-
Do one of the following:
- Click Task Manager to open the Task Manager and monitor the status of the discovery analysis job if run now. Scheduled tasks show a status of pending.
- Click Done to close the window.
-
If the analysis discovered any keys, click Refresh to see the number of keys updated in the data source's Relationships table.