Alternate method to install the AMD ROCm platform

The accelerator role allows users to set up the AMD ROCm platform. This tools allow users to unlock the potential of installed AMD GPUs.

Prerequisites

  • The ROCm local repositories must be configured using the local_repo.yml script.

  • The input/software_config.json must contain valid amdgpu and rocm version. See input parameters for more information.

Playbook configurations

The following configurations takes place while running the accelerator.yml playbook:

  1. Servers with AMD GPUs are identified and the latest GPU drivers and ROCm platforms are downloaded and installed.

  2. Servers with no GPU are skipped.

Executing the playbook

To install all the latest GPU drivers and toolkits, run:

cd accelerator
ansible-playbook accelerator.yml -i inventory

User permissions for ROCm platforms

  • To add an user to the render and video group, use the following command:

    sudo usermod -a -G render,video <user>
    

Note

  • <user> is the system name of the end user.

  • This command must be run with root permissions.

  • If the root user wants to provide access to other users and their individual GPU nodes, the previous command needs to be run on all of them separately.

  • To enable users to use ROCm tools, use the following command as shown in the below added sample file:

    /opt/rocm/bin/<ROCm command>
    
../../../_images/ROCm_user_permissions.png

For any configuration changes, check out ROCm’s official documentation here.

If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.