Trillium can profile data from a variety of heterogeneous sources, and each data source uses unique terminology. To enable you to more easily compare data and relationships across a variety of sources, Trillium uses the standard terms defined below for all data sources. We recommend that you familiarize yourself with these definitions before you start to work with projects.
This table lists terms that you will encounter as you use Trillium.
Term | Description |
---|---|
attribute | Depending on the structure of the data source, an attribute might represent a column or a field. |
dependency | Data relationship in which one or more attributes determine the value of another attribute. |
dynamic entity | An entity that is linked to an external data source and is not imported into a repository. See real entity. |
entity | File or table associated with a data source. See also real entity. |
finding | Documented results of a data discovery activity. |
join | Intersection of identical or related data across two or more entities. |
key | Attribute that uniquely identifies and associates data within an entity, binding the data together. |
repository | Object that contains the data and metadata on which you perform data discovery and data quality activities. |
metadata | Statistics and properties associated with a repository object. |
project | Object that you create to reference a set of data and the data quality activities you perform. It includes information about metadata and workflow tasks. |
real entity | An entity in which the data is loaded into the repository. See also dynamic entity. |
repository server | Collection of one or more repositories. It has its own group of users, loader connections, and security and performance settings. |
row | A data record that is associated with a specific entity. |
schema | A file that describes the shape of your input or output data. |