pdsh and /etc/hosts
pdsh
pdsh is a shell tool that will execute a shell command on multiple nodes in quasi parallel fashion. It can be installed directly from the repositories:
pdsh requires keys or other passwordless authentication between nodes. To test, you can use a simple command like 'hostname' and have it execute on all nodes:
To avoid needing to use -R ssh with every invocation, you can set the PDSH_RCMD_TYPE environment variable to ssh:
To do this for all users, create an appropriate file under /etc/profile.d:
For pdcp, and for using pdsh from compute nodes rather then the master, pdsh must be installed on the compute nodes as well. This is doable using pdsh from the master node to install pdsh from the repositories onto the compute nodes.
/etc/hosts
Up until now, we have been accessing compute nodes by IP address, which can get tedious. /etc/hosts can be configured/edited using pdsh to add declarations for each compute node.
Last updated