Omnia

Ansible playbook-based tools for deploying Slurm and Kubernetes clusters for High Performance Computing, Machine Learning, Deep Learning, and High-Performance Data Analytics

This project is maintained by dellhpc

Pre-requisites Before Running Control Plane

` pip3.6 install –upgrade pip <br> pip3.8 install –upgrade pip <br> python3.6 -m pip install ansible <br> python3.8 -m pip install ansible==5.10.0`

After the installation is complete, run ansible --version to verify if the installation is successful. In the output, ensure that the executable location path is present in the PATH variable by running echo $PATH. If executable location path is not present, update the path by running export PATH=$PATH:<executable location>\.

For example,

ansible -- version
ansible 2.10.9
config file = None
configured module search path = ['/root/.ansible/plugins/modules', '/usr/share/ansible/plugins/modules']
ansible python module location = /usr/local/lib/python3.6/site-packages/ansible
executable location = /usr/local/bin/ansible
python version = 3.6.8 (default, Aug 24 2020, 17:57:11) [GCC 8.3.1 20191121 (Red Hat 8.3.1-5)]

The executable location is /usr/local/bin/ansible. Update the path by running the following command:

export PATH=$PATH:/usr/local/bin

Note:

  • To deploy Omnia, Python 3.6 provides bindings to system tools such as RPM, DNF, and SELinux. As versions greater than 3.6 do not provide these bindings to system tools, ensure that you install Python 3.6 with dnf.
  • SELinux will not be disabled and will be in permissive mode when control plane is executing.
  • If Ansible version 2.9 or later is installed, ensure it is uninstalled before installing a newer version of Ansible. Run the following commands to uninstall Ansible before upgrading to newer version.
    1. pip uninstall ansible
    2. pip uninstall ansible-base (if ansible 2.9 is installed)
    3. pip uninstall ansible-core (if ansible 2.10 > version is installed)

Note:

  • After the installation of the Omnia appliance, changing the control plane is not supported. If you need to change the control plane, you must redeploy the entire cluster.
  • If there are errors while executing any of the Ansible playbook commands, then re-run the commands.