vLLM enablement for AMD MI300 GPU

Note

This whole execution will take approximately 3-4 hours.

Follow the below steps to setup the vLLM:

Build vLLM
- Update the admin-nic-IP in the vllm_k8s_config.yml file located inside the omnia/utils/vllm_build directory.
- Run the vllm_build.yml playbook using:
  ansible-playbook vllm_build.yml
Verify vLLM

Once the playbook is executed, run the following command to verify whether vLLM image generation was successful.

nerdctl images | grep vllm

Update “package” and “tag” details in the vllm.json file located at omnia/tools/input/config/ubuntu/<22.04 or 24.04>/vllm.json, as shown below.

"vllm_amd": {

    "cluster": [

      {

        "package": "vllm-rocm",

        "tag": "latest",

        "type": "image"

      }

    ]

  }

Finally, deploy the latest vllm using the vllm.yml playbook located at omnia/tools/vllm.yml. Use the following command:

ansible-playbook vllm.yml -i inv.ini

A sample inventory is attached below:

inv.ini

[kube_node]

10.5.x.a

10.5.x.b

If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.