Before you can create a project, you must create an entity that contains (or links to) your data. There are several distinct tasks involved:
- Select the data source
- Establish the schema file settings, based on the data source
- Preview the data
- Customize the data
- Set the load parameters
- Schedule the job
- Each entity you create requires that a repository administrator first do the
following:
- Create repositories (one or more) into which you will import the data
- Create loader connections, one for each type of data source to which you want to connect
- Grant you permission to use the repositories
If these tasks have not been set up properly, you will not be able to create an entity and start your work.
- During the entity creation process, you can optionally create or modify a project that uses the new entity. The following procedure only describes the steps required to create an entity.
- You can also add entities in the Discovery Center. These entities will display on the Control Center Entities tab. Similarly, entities you create in the Control Center will be available when you access the same repository through the Discovery Center. For more information, see Working with the Discovery Center.
- To create HDFS entities (data sources) and profile HDFS data in your Hadoop environment, use the Discovery Center application, installed with the Trillium repository server software. HDFS entities can be viewed in Control Center but are supported for profiling activities in Discovery Center only.
CyberArk Security Integration
Discovery Center and Control Center can be integrated with your company’s CyberArk account security solution to make retrieving and passing database credentials more secure.
To use the integration, when you add a data source (entity) that uses a data (loader) connection to a password-protected database, rather than entering the database user ID and password, supply CyberArk credentials. Then, each time the data connection accesses a password-protected ODBC database, the database credentials are retrieved from an encrypted digital vault on a centralized CyberArk server. The credentials are unique for each data source you add. Even when CyberArk is configured, you still have the option to enter your standard database user name ID and password.
To create an entity