Alternate method to install the AMD ROCm platform
The accelerator role allows users to set up the AMD ROCm platform. This tools allow users to unlock the potential of installed AMD GPUs.
Prerequisites
The ROCm local repositories must be configured using the local_repo.yml script.
The
input/software_config.jsonmust contain validamdgpuandrocmversion. See input parameters for more information.
Playbook configurations
The following configurations takes place while running the accelerator.yml playbook:
Servers with AMD GPUs are identified and the latest GPU drivers and ROCm platforms are downloaded and installed.
Servers with no GPU are skipped.
Executing the playbook
To install all the latest GPU drivers and toolkits, run:
cd accelerator
ansible-playbook accelerator.yml -i inventory
User permissions for ROCm platforms
To add an user to the
renderandvideogroup, use the following command:sudo usermod -a -G render,video <user>
Note
<user> is the system name of the end user.
This command must be run with
rootpermissions.If the root user wants to provide access to other users and their individual GPU nodes, the previous command needs to be run on all of them separately.
To enable users to use ROCm tools, use the following command as shown in the below added sample file:
/opt/rocm/bin/<ROCm command>
For any configuration changes, check out ROCm’s official documentation here.
If you have any feedback about Omnia documentation, please reach out at omnia.readme@dell.com.