vLLM enablement for AMD MI300 GPU
Note
This whole execution will take approximately 3-4 hours.
MI300 support is enabled with vllm version 0.3.2
The
vllm_build.ymlfile is located insideomnia/utils/vllm_build.
Follow the below steps to setup the vLLM:
Build vLLM
Update the
admin-nic-IPin thevllm_k8s_config.ymlfile located inside theomnia/utils/vllm_builddirectory.Run the
vllm_build.ymlplaybook using:ansible-playbook vllm_build.yml
Verify vLLM
Once the playbook is executed, run the following command to verify whether vLLM image generation was successful.
nerdctl images | grep vllm
Update “package” and “tag” details in the
vllm.jsonfile located atomnia/tools/input/config/ubuntu/<22.04 or 24.04>/vllm.json, as shown below.
"vllm_amd": {
"cluster": [
{
"package": "vllm-rocm",
"tag": "latest",
"type": "image"
}
]
}
Finally, deploy the latest vllm using the
vllm.ymlplaybook located atomnia/tools/vllm.yml. Use the following command:
ansible-playbook vllm.yml -i inv.ini
A sample inventory is attached below:
inv.ini
[kube_node]
10.5.x.a
10.5.x.b
If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.