Connect to Databricks File Systems (DBFS) - Connect_ETL - 9.13

Connect ETL Installation Guide

Product type
Software
Portfolio
Integrate
Product family
Connect
Product
Connect > Connect (ETL, Sort, AppMod, Big Data)
Version
9.13
Language
English
Product name
Connect ETL
Title
Connect ETL Installation Guide
Copyright
2024
First publish date
2003
Last updated
2024-11-08
Published on
2024-11-08T16:36:35.232000

Connect for Big Data supports connecting to Databricks File System (DBFS) source and target content using a remote file connection. This section describes how to create a remote DBFS connection.

Before you create the remote connection, specify parameters in a Connect execution profile file to support the Databricks deployment configuration. See Work with the Connect Execution Profile File.

Databricks DBFS connection requirements

  • Install Connect server on an Amazon Elastic Compute Cloud (EC2) instance, Azure Virtual Machine (VM), or your local machine.
  • Before you create the remote connection, specify Databricks deployment configuration parameters in a Connect execution profile. Use a global, user, and/or job-specific execution profile. This configuration supports the Databricks deployment. Without this configuration, the DBFS is unreachable. See Work with the Connect Execution Profile File.
  • Connect accesses Databricks using keys-based authentication. If no access keys are provided, Connect issues a UNIAMCRE error message aborts the job.