Among the system configuration considerations are the settings and setup required on the local ETL servers.
Local ETL Server
On your local ETL server, do the following:
- Define each of the environment variable values in the configuration file.
The configuration file is run upon initiation of the job initiator script. When the configuration file is run, the environment variable values are available to the job initiator script, either runLargeFileLoader.sh or runMultiFileLoader.sh, which initiates the Connect ETL jobs, and the database load script, verticaLoad.sh, which generates and executes the vsql commands that interface directly with the Vertica initiator nodes.
- As a best practice, ensure that each of the components of the Connect ETL Vertica load example files, that is the configuration file, the job initiator scripts, and the database load script, are saved to the directory associated with the Connect ETL jobs. The Connect ETL jobs directory is defined by the DMXDataDirectory environment variable, which is listed in the configuration file.
Saving the components of the Connect ETL Vertica load example files to the directory defined by the DMXDataDirectory environment variable ensures that the scripts are in a protected directory after you modify them to meet the requirements of your installation.
To protect the LargeFileLoader.sh, MultiFileLoader.sh, runLargeFileLoader.sh, runMultiFileLoader.sh, and verticaLoad.sh scripts after they are customized for the environment, set their property attributes to Read-Only.