Setting the download permissions allows multiple services to download and update the downloaded data when required. You should have a common operating system group of which all the service users who need to download the data are part of. For example, if Hive and YARN jobs are required to download data and use the same download location, then both the Hive and YARN operating system users should be part of a common operating system group. The group of the download directory should be the common operating system group, one that has Read, Write, and Execute (775) permissions for the owner and group.
Your group should contain services and users that will run jobs in your cluster. You may skip services you will not use or do not have installed. Services include YARN, Hive, Zeppelin, and Hue.
You also should include all operating system users who will run jobs such as sdkuser and <myOtherUser>.
- Add the group.
sudo groupadd dm_users
- Add users to the group.
sudo usermod -a -G dm_users hive sudo usermod -a -G dm_users yarn sudo usermod -a -G dm_users zeppelin sudo usermod -a -G dm_users hue sudo usermod -a -G dm_users sdkuser sudo usermod -a -G dm_users <myOtherUser>
- Using a window where no job is running, restart all the services whose operating system users were added to the new group.
- Using a window where no job is running, restart the session of all the operating system users that were added to new group (for example, sdkuser).
- Update the group to the common operating system group and update permissions to 775 for
the download directory specified in the
pb.download.location
property.sudo chgrp dm_users /precisely/downloads sudo chmod 775 /precisely/downloads