Re-provisioning the cluster

In the event that an existing Omnia cluster needs a different OS version or a fresh installation, the cluster can be re-provisioned. While re-provisioning the cluster, users can modify the following:

  • The operating system

  • CUDA

  • OFED

Omnia can re-provision the cluster by running the following command:

cd provision
ansible-playbook provision.yml -i inventory

Alternatively, if a re-deployment with no modifcations are required

cd provision
ansible-playbook discovery_provision.yml -i inventory

Where the inventory contains a list of host IPs (Sourced from the nodeinfo table) as shown below:

10.5.0.101
10.5.0.102

Note

  • The host IPs passed in the inventory should be assigned by Omnia. They will not be changed during the re-provisioning.

  • If the nodes were discovered via snmpwalk or mapping, users will be required to manually reboot target nodes.

  • Do not include groups like manager, compute and login in the passed inventory.

Setting up the cluster

  1. Insert the new IPs (only if a new node is to be added) and/or move nodes between groups in the existing inventory file following the below example.

Existing inventory

[manager]
10.5.0.101

[compute]
10.5.0.102
10.5.0.103

[login]
10.5.0.104

Updated inventory with the new node information

[manager]
10.5.0.102

[compute]
10.5.0.101
10.5.0.103
10.5.0.104
10.5.0.106

[login]
10.5.0.105

In the above example, the compute node: 10.5.0.102 has been moved to manager, a new login node: 10.5.0.105 has been set up, and the node: 10.5.0.106 has been added to the cluster as a compute node.

Note

To change the configuration of the cluster, the files input/omnia_config.yml, input/security_config.yml and input/storage_config.yml can be updated before running omnia.yml.

  1. To install security, job scheduler and storage tools (NFS, BeeGFS) on the node, run omnia.yml:

    ansible-playbook omnia.yml -i inventory
    

If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.