Distributing Reference Data Using HDFS - Spectrum_Routing_for_Big_Data - 5.1

Spectrum Routing Installation: Hortonworks

Product type
Software
Portfolio
Locate
Product family
Spectrum
Product
Spatial Big Data > Routing for Big Data
Version
5.1
Language
English
Product name
Spectrum Routing for Big Data
Title
Spectrum Routing Installation: Hortonworks
Copyright
2024
First publish date
2017
Last updated
2024-10-18
Published on
2024-10-18T09:54:47.614515

Now that the SDK is installed and the routing reference data is configured the reference data must be distributed around the cluster.

For the purposes of this guide, we will:

  • continue using sdkuser
  • install the reference data into hdfs:///precisely/routing/data

  1. Create an install directory on hdfs and give ownership to sdkuser.
    sudo -u hdfs hadoop fs -mkdir hdfs:///precisely
    sudo -u hdfs hadoop fs -chown sdkuser:sdkuser hdfs:///precisely
  2. Upload the reference data into HDFS.
    hadoop fs -mkdir hdfs:///precisely/routing
    hadoop fs -copyFromLocal /precisely/routing/data hdfs:///precisely/routing/data
  3. When the data node performs routing tasks, the node will download the reference data from HDFS and onto the local file system. This means a local directory needs to be set up on all data nodes. Perform the following commands on all data nodes and HiverServer nodes.
    sudo mkdir /precisely/downloads
    sudo chown sdkuser:hadoop /precisely/downloads
    sudo chmod 775 /precisely/downloads