vLLM enablement for AMD MI300 GPU

Note

This whole execution will take approximately 3-4 hours.

  • MI300 support is enabled with vllm version 0.3.2

  • The vllm_build.yml file is located inside omnia/utils/vllm_build.

Follow the below steps to setup the vLLM:

  1. Build vLLM

    • Update the admin-nic-IP in the vllm_k8s_config.yml file located inside the omnia/utils/vllm_build directory.

    • Run the vllm_build.yml playbook using:

      ansible-playbook vllm_build.yml
      
  2. Verify vLLM

Once the playbook is executed, run the following command to verify whether vLLM image generation was successful.

nerdctl images | grep vllm
  1. Update “package” and “tag” details in the vllm.json file located at omnia/tools/input/config/ubuntu/22.04/vllm.json, as shown below.

"vllm_amd": {



    "cluster": [

      {

        "package": "vllm-rocm",

        "tag": "latest",

        "type": "image"

      }

    ]



  }
  1. Finally, deploy the latest vllm using the vllm.yml playbook located at omnia/tools/vllm.yml. Use the following command:

ansible-playbook vllm.yml -i inv.ini

A sample inventory is attached below:

inv.ini

[kube_node]

10.5.x.a

10.5.x.b

If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.