Connect for Big Data is compatible with the following authorization schemes:
- Cloudera Sentry
- Apache Ranger
Cloudera Sentry
Connect for Big Data is certified to work with Cloudera's Sentry authorization of Hive databases, which requires the following to be enabled in the Cloudera cluster:
- HDFS Access Control Lists (ACLs)
- automatic synchronization of HDFS ACLs with Sentry privilegesNote: When using Sentry, Hive impersonation is disabled by default. To ensure access to the Work table directory, the default Hive user must have the correct permissions.
Apache Ranger
Connect for Big Data is compatible with Apache Ranger, a framework for enabling, monitoring,
and managing data security across the Hadoop platform. Ranger works with Apache Hadoop
(HDFS), Apache Hive, Apache Kafka, and YARN, among other Apache projects.
Note: Ranger is
currently designated as an Apache incubator project, and there are gaps in what it works
with in the Hadoop ecosystem, such as Apache HCatalog. Additionally, it does not work with
Amazon S3 or other cloud-based distributed filesystems.