Apache Hadoop Hive Distribution Support - trillium_discovery - trillium_quality - 17.1

Trillium DQ Repository Administrator Guide

Product type
Software
Portfolio
Verify
Product family
Trillium
Product
Trillium > Trillium Discovery
Trillium > Trillium Quality
Version
17.1
Language
English
Product name
Trillium Quality and Discovery
Title
Trillium DQ Repository Administrator Guide
Topic type
Administration
Overview
How Do I
Configuration
Reference
Installation
First publish date
2008

You can load data into Trillium from an Apache Hive environment using the TSS ODBC Hadoop Hive Driver.

Note: You can only read data from a Apache Hive environment. You cannot write to it.

The table given below shows the Hadoop distribution support for both Windows and Linux systems. For Windows, 64-bit drivers are available. For Linux, both 32-bit and 64-bit drivers are available.

Table 1. Apache Hadoop Hive Distribution Support for Linux and Windows

Hadoop Distribution

Distribution Version

Apache Hive Version

Apache Hadoop Hive

N/A

Hive 1.0.x

Hive 1.1.x

Hive 1.2.x

Hive 2.0.x

Hive 2.1.x

Hive 3.1.x

Cloudera's Distribution Including Apache Hadoop (CDH)

CDH 5.3

CDH 5.4

CDH 5.5

CDH 5.6

CDH 5.7

CDH 5.8

CDH 5.9

CDH 5.10

CDH 5.11

CDH 5.12

Hive 1.1.x

Hortonworks Distribution for Apache Hadoop

HDP 2.3

HDP 2.4

HDP 2.5

Hive 1.2.1

IBM BigInsights

BigInsights 4.0

BigInsights 4.1

BigInsights 4.2

BigInsights 4.3

Hive 1.1.x (for 4.0) and Hive 1.2.1 for others

 

MapR Distribution for Apache Hadoop

MapR 5.0

MapR 5.1

MapR 5.2

Hive 2.1.x

Pivotal HD Enterprise (PHD)

PHD 3.0

Hive 1.1.x

About TSS 17 Hadoop Hive Driver

Create the Apache Hive data source by selecting the TSS 17 Hadoop Hive Driver from the list of supported drivers (Supported Data Sources for TSS ODBC) and following the instructions in Creating DSN. Configure the Advanced settings by changing the Max Varchar field from 2GB (default) to 32KB. Otherwise, you may not be able to see the data correctly.

To change the Max Varchar

  1. Open the Advanced tab of the ODBC Apache Hive Wire Protocol Driver Setup window.

    Configuring TSS 17 Hadoop Hive Driver
  2. In Max Varchar Size, enter 32768.

  3. Click Apply.