Omnia

Ansible playbook-based tools for deploying Slurm and Kubernetes clusters for High Performance Computing, Machine Learning, Deep Learning, and High-Performance Data Analytics

This project is maintained by dellhpc

Omnia (Latin: all or everything) is a deployment tool to configure Dell EMC PowerEdge servers running standard RPM-based Linux OS images into clusters capable of supporting HPC, AI, and data analytics workloads. It uses Slurm, Kubernetes, and other packages to manage jobs and run diverse workloads on the same converged solution. It is a collection of Ansible playbooks, is open source, and is constantly being extended to enable comprehensive workloads.

Current release version

1.3

Previous release version

1.2.2

Blogs about Omnia

What Omnia does

Omnia can deploy and configure devices, and build clusters that use Slurm or Kubernetes (or both!) for workload management. Omnia will install software from a variety of sources, including:

Whenever possible, Omnia will leverage existing projects rather than reinvent the wheel.

Omnia stacks

Omnia can deploy firmware, install Kubernetes or Slurm (or both), along with additional drivers, services, libraries, and user applications. Omnia Kubernetes Stack

Omnia Slurm Stack

What’s new in this release

Using Omnia

  1. Set up your network based on the sample topologies provided here:
  2. Verify that your system meets Omnia’s hardware and software requirements
  3. Ensure that all pre-requisites are met.
  4. Fill out all the required input parameters.
  5. Run Control_Plane to provision OS’s, configure devices and set up security measures.
  6. Run Omnia on target cluster nodes to set up Kubernetes and Slurm.
  7. Run the telemetry playbook to set up and use Telemetry and Visualization Services

Note: For a complete guide to running Omnia via CLI, check this out!

Omnia Flow

Troubleshooting Omnia

Contributing to Omnia

The Omnia project was started to give members of the Dell Technologies HPC Community a way to easily set up clusters of Dell EMC servers, and to contribute useful tools, fixes, and functionality back to the HPC Community.

Open to All

While we started Omnia within the Dell Technologies HPC Community, that doesn’t mean that it’s limited to Dell EMC servers, networking, and storage. This is an open project, and we want to encourage everyone to use and contribute to Omnia!

Anyone can contribute!

It’s not just new features and bug fixes that can be contributed to the Omnia project! Anyone should feel comfortable contributing. We are asking for all types of contributions:

If you would like to contribute, see CONTRIBUTING.