Connect for Big Data supports connecting to Databricks File System (DBFS) source and target content using a remote file connection. This section describes how to create a remote DBFS connection.
Before you create the remote connection, specify parameters in a Connect execution profile file to support the Databricks deployment configuration. See Work with the Connect Execution Profile File.
Databricks DBFS connection requirements
- Install Connect server on an Amazon Elastic Compute Cloud (EC2) instance, Azure Virtual Machine (VM), or your local machine.
- Before you create the remote connection, specify Databricks deployment configuration parameters in a Connect execution profile. Use a global, user, and/or job-specific execution profile. This configuration supports the Databricks deployment. Without this configuration, the DBFS is unreachable. See Work with the Connect Execution Profile File.
- Connect accesses Databricks using keys-based authentication. If no access keys are provided, Connect issues a UNIAMCRE error message aborts the job.