vLLM enablement for AMD MI300 GPU ---------------------------------- .. note:: This whole execution will take approximately 3-4 hours. * MI300 support is enabled with vllm version 0.3.2 * The ``vllm_build.yml`` file is located inside ``omnia/utils/vllm_build``. Follow the below steps to setup the vLLM: 1. **Build vLLM** * Update the ``admin-nic-IP`` in the ``vllm_k8s_config.yml`` file located inside the ``omnia/utils/vllm_build`` directory. * Run the ``vllm_build.yml`` playbook using: :: ansible-playbook vllm_build.yml 2. **Verify vLLM** Once the playbook is executed, run the following command to verify whether vLLM image generation was successful. :: nerdctl images | grep vllm 3. Update "package" and "tag" details in the ``vllm.json`` file located at ``omnia/tools/input/config/ubuntu/<22.04 or 24.04>/vllm.json``, as shown below. :: "vllm_amd": { "cluster": [ { "package": "vllm-rocm", "tag": "latest", "type": "image" } ] } 4. Finally, deploy the latest vllm using the ``vllm.yml`` playbook located at ``omnia/tools/vllm.yml``. Use the following command: :: ansible-playbook vllm.yml -i inv.ini A sample inventory is attached below: :: inv.ini [kube_node] 10.5.x.a 10.5.x.b