You can load data into Trillium from an Apache Hive environment using the TSS ODBC Hadoop Hive Driver.
The table given below shows the Hadoop distribution support for both Windows and Linux systems. For Windows, 64-bit drivers are available. For Linux, both 32-bit and 64-bit drivers are available.
Hadoop Distribution |
Distribution Version |
Apache Hive Version |
---|---|---|
Apache Hadoop Hive |
N/A |
Hive 1.0.x Hive 1.1.x Hive 1.2.x Hive 2.0.x Hive 2.1.x Hive 3.1.x |
Cloudera's Distribution Including Apache Hadoop (CDH) |
CDH 5.3 CDH 5.4 CDH 5.5 CDH 5.6 CDH 5.7 CDH 5.8 CDH 5.9 CDH 5.10 CDH 5.11 CDH 5.12 |
Hive 1.1.x |
Hortonworks Distribution for Apache Hadoop |
HDP 2.3 HDP 2.4 HDP 2.5 |
Hive 1.2.1 |
IBM BigInsights |
BigInsights 4.0 BigInsights 4.1 BigInsights 4.2 BigInsights 4.3 |
Hive 1.1.x (for 4.0) and Hive 1.2.1 for others
|
MapR Distribution for Apache Hadoop |
MapR 5.0 MapR 5.1 MapR 5.2 |
Hive 2.1.x |
Pivotal HD Enterprise (PHD) |
PHD 3.0 |
Hive 1.1.x |
About TSS 17 Hadoop Hive Driver
Create the Apache Hive data source by selecting the TSS 17 Hadoop Hive Driver from the list of supported drivers (Supported Data Sources for TSS ODBC) and following the instructions in Creating DSN. Configure the Advanced settings by changing the Max Varchar field from 2GB (default) to 32KB. Otherwise, you may not be able to see the data correctly.
To change the Max Varchar
-
Open the Advanced tab of the ODBC Apache Hive Wire Protocol Driver Setup window.
-
In Max Varchar Size, enter 32768.
-
Click Apply.