This product is verified on the following Hadoop distributions:
- Cloudera 6.2.1 , 6.3, 7.1 and 7.5
- Hortonworks 3.1
- EMR 5.30, 6.0, 6.30
To use the product, you must be familiar with configuring Hadoop in Hortonworks, Cloudera, or EMR, and developing applications for distributed processing. For more information, refer to Hortonworks, Cloudera, or EMR documentation.
The following additional tools must be available to use certain product features:
for Hive:- Hive version 1.2.1 or above
- Hive in Spectrum Geocoding for Big Data is not supported on Cloudera 5.16
- Hive version 1.2.1 or above
- Hive in Spectrum Geocoding for Big Data is not supported on Cloudera 5.16
- Java JDK version 1.8 or above
- Hadoop version 2.6.0 or above
- Spark version 2.0 or above
- Zeppelin Notebook is not supported in Cloudera
GGS Memory Requirements
- The amount of memory required depends on the deployment scenario implemented. The basic memory requirement for the Spectrum Global Geocoding SDK is 16 GB RAM. We recommend 32 GB RAM.
- If you are using a large number of datasets (more than 20), review your minimum heap size setting. Consider increasing it to at least 8 GB to prevent out of memory exception errors.
- For more information, see the Developer Guide.