Input parameters for Local Repositories
Input all required values in
input/software_config.json.
Parameter |
Details |
|---|---|
cluster_os_type
Required |
|
cluster_os_version
Required |
Note Specify only the major version for the Default value: |
repo_config
Required |
Note
|
softwares
Required |
Note The accepted names for software is taken from |
Sample version for Ubuntu:
{
"cluster_os_type": "ubuntu",
"cluster_os_version": "22.04",
"repo_config": "partial",
"softwares": [
{"name": "amdgpu", "version": "6.3.1"},
{"name": "cuda", "version": "12.8.0"},
{"name": "bcm_roce", "version": "232.1.133.2"},
{"name": "ofed", "version": "24.01-0.3.3.1"},
{"name": "openldap"},
{"name": "secure_login_node"},
{"name": "nfs"},
{"name": "beegfs", "version": "7.4.5"},
{"name": "k8s", "version":"1.31.4"},
{"name": "roce_plugin"},
{"name": "jupyter"},
{"name": "kubeflow"},
{"name": "kserve"},
{"name": "pytorch"},
{"name": "tensorflow"},
{"name": "vllm"},
{"name": "telemetry"},
{"name": "ucx", "version": "1.15.0"},
{"name": "openmpi", "version": "4.1.6"},
{"name": "intelgaudi", "version": "1.19.2-32"},
{"name": "csi_driver_powerscale", "version":"v2.13.0"}
{"name": "intel_benchmarks", "version": "2024.1"},
{"name": "amd_benchmarks"}
],
"bcm_roce": [
{"name": "bcm_roce_libraries", "version": "232.1.133.2"}
],
"amdgpu": [
{"name": "rocm", "version": "6.3.1" }
],
"intelgaudi": [
{"name": "intel"}
],
"vllm": [
{"name": "vllm_amd"},
{"name": "vllm_nvidia"}
],
"pytorch": [
{"name": "pytorch_cpu"},
{"name": "pytorch_amd"},
{"name": "pytorch_nvidia"},
{"name": "pytorch_gaudi"}
],
"tensorflow": [
{"name": "tensorflow_cpu"},
{"name": "tensorflow_amd"},
{"name": "tensorflow_nvidia"}
]
}
For a list of accepted values in softwares, go to input/config/<cluster_os_type>/<cluster_os_version> and view the list of JSON files available. The filenames present in this location (without the * .json extension) are a list of accepted software names. The repositories to be downloaded for each software are listed the corresponding JSON file. For example, for a cluster running Ubuntu 22.04, go to input/config/ubuntu/22.04/ and view the file list:
amdgpu.json
bcm_roce.json
beegfs.json
cuda.json
jupyter.json
k8s.json
kserve.json
kubeflow.json
roce_plugin.json
nfs.json
ofed.json
openldap.json
pytorch.json
tensorflow.json
vllm.json
intelgaudi.json
For a list of repositories (and their types) configured for AMD GPUs, view the amdgpu.json file:
{
"amdgpu": {
"cluster": [
{"package": "linux-headers-$(uname -r)", "type": "deb", "repo_name": "jammy"},
{"package": "linux-modules-extra-$(uname -r)", "type": "deb", "repo_name": "jammy"},
{"package": "amdgpu-dkms", "type": "deb", "repo_name": "amdgpu"}
]
},
"rocm": {
"cluster": [
{"package": "rocm", "type": "deb", "repo_name": "rocm"},
{"package": "rocm-validation-suite", "type": "deb", "repo_name": "rocm"}
]
}
}
Note
To configure a locally available repository that does not have a pre-defined json file, click here.
Input the required values in
input/local_repo_config.yml.
Parameter |
Details |
|---|---|
repo_store_path
Required |
Default value: |
user_repo_url
Optional |
|
user_registry
Optional |
|
ubuntu_os_url
Required |
|
omnia_repo_url_ubuntu
Required |
Default value: - { url: "https://download.docker.com/linux/ubuntu {{ os_release }} stable", gpgkey: "https://download.docker.com/linux/ubuntu/gpg" }
- { url: "https://repo.radeon.com/rocm/apt/{{ rocm_version }} {{ os_release }} main", gpgkey: "https://repo.radeon.com/rocm/rocm.gpg.key" }
- { url: "https://www.beegfs.io/release/beegfs_{{beegfs_version}} {{ os_release }} non-free", gpgkey: "https://www.beegfs.io/release/beegfs_{{beegfs_version}}/gpg/GPG-KEY-beegfs" }
- { url: "https://repo.radeon.com/amdgpu/{{ amdgpu_version }}/ubuntu {{ os_release }} main", gpgkey: "https://repo.radeon.com/rocm/rocm.gpg.key" }
- { url: "https://ltb-project.org/debian/openldap25/jammy jammy main", publickey: "https://ltb-project.org/documentation/_static/RPM-GPG-KEY-LTB-project" }
- { url: "https://nvidia.github.io/libnvidia-container/stable/deb/amd64 /", gpgkey: "https://nvidia.github.io/libnvidia-container/gpgkey" }
- { url: "http://ppa.launchpad.net/deadsnakes/ppa/ubuntu {{ os_release }} main", gpgkey: "" }
- { url: "https://a2o.github.io/snoopy-packages/repo/ubuntu {{ os_release }} stable", publickey: "https://a2o.github.io/snoopy-packages/snoopy-packages-key.pub" }
- { url: "https://vault.habana.ai/artifactory/debian {{ os_release }} main", publickey: "https://vault.habana.ai/artifactory/api/gpg/key/public" }
|
Input
docker_usernameanddocker_passwordininput/provision_config_credentials.ymlto avoid image pullback errors.
If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.