Hive data warehouses - Connect_ETL - 9.13

Connect ETL Installation Guide

Product type
Software
Portfolio
Integrate
Product family
Connect
Product
Connect > Connect (ETL, Sort, AppMod, Big Data)
Version
9.13
ft:locale
en-US
Product name
Connect ETL
ft:title
Connect ETL Installation Guide
Copyright
2025
First publish date
2003
ft:lastEdition
2025-01-24
ft:lastPublication
2025-01-24T21:47:52.840000

Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis of large datasets stored in Hadoop's Distributed File System (HDFS) and other compatible file systems. Hive includes HiveQL, a query language useful for real-time analytics in Hadoop.

Connect for Big Data can connect to Hive data warehouses as:

  • sources when running on the ETL server/edge node or in the cluster
  • targets when running on the ETL server/edge node or in the cluster

Hive tables can also be accessed as HCatalog sources and targets.