Trillium supports simple data sample tests during entity creation. If you require complex sampling, you may want to pre-process it before you create the entity.
If you pre-process the data in any way prior to creating entities, we recommend that you document all data preparation or sampling information by adding notes to the entity after you import to a repository.
For best results when importing samples of data, make sure the imported data contains a consistent sampling of data across all entities you plan to import to the repository. If the data sample is inconsistent, the resulting sample data analysis will not be representative of the data in the data source.
For example, if you imported only 10,000 customer records, and imported all 100,000 account records, the result will be a 10% match quality of these two entities. However, if all customer records are imported, then the match quality might be 90% or higher.
In this same example, if you imported the first 10,000 records to create a customer entity, then you should make sure the accounts for the same customers are imported. This will give you a higher % match quality.