fix(ui): enable rocm-smi support by correcting flags and parsing #580

Soddentrough · 2025-12-07T09:23:54Z

ai-toolkit runs on systems with AMD GPUs but displays an error about 'nvidia-smi' in the dashboard when doing so.

This patch removes the hard-coded dependency on 'nvidia-smi' allowing ai-toolkit to operate with either 'nvidia-smi' or 'rocm-smi'. It first checks for 'nvidia-smi' and then checks for 'rocm-smi' which may cause an issue if both are installed but it solves a need today.

dkspwndj · 2025-12-07T11:17:40Z

Thank you for proceeding with the modification!! But it's not running properly.. I look forward to seeing good results in the future!!

Soddentrough · 2025-12-07T22:22:25Z

Thinking this might be a path issue (where is your rocm-smi installed?) I will update the logic to check for rocm-smi in this order:

which rocm-smi
2, Check for $ROCM_PATH/bin/rocm-smi
Check for /usr/bin/rocm-smi
Check for /opt/rocm/bin/rocm-smi

Soddentrough · 2025-12-08T06:22:50Z

There was a parsing issue when handling the JSON output of rocm-smi creating a phantom device.

dkspwndj · 2025-12-08T14:31:50Z

Forgive me.. I've been laying down the ROCm to the extent that I need the CompyUI with Zluda and then I figured out if the ROCm was properly laid. Now I'll make a separate PyTorch 3.12 folder to lay the ROCm and try it out there..

dkspwndj · 2025-12-08T14:51:41Z

Now tested ROCm 7.1.1 installed venv.
But.. not work well..

Soddentrough · 2025-12-08T20:32:24Z

I'm sorry I didn't notice you're testing with a Windows system. rocm-smi is only available on Linux or WSL. Might be able to use hipinfo.exe on Windows to enumerate the devices but I don't think that has dynamic performance statistics for power/utilization/mem, so stats would show "0".

I don't currently have a way of testing this though and I think for Windows maybe using "Get-Counter" for dynamic performance counters could be the way to go.

… GTT support

Soddentrough · 2025-12-08T22:14:36Z

This now uses amd-smi by default with fallback to rocm-smi. And where amd-smi doesn't fully support a GPU (eg: Strix iGPU) we use the sysfs hwmon metrics. This also allows us to show "VRAM" and "GTT" (shared memory) used by an APU.

fix(ui): enable rocm-smi support by correcting flags and parsing

29cc2fc

Soddentrough added 2 commits December 8, 2025 09:24

Fix rocm-smi detection and improve error messages

ab97aea

Fix ghost device in rocm-smi by filtering system object

2e3e3ef

Add prelim hipinfo.exe support for Windows AMD GPUs

ead7a66

feature: Improve AMD GPU monitoring with amd-smi, sysfs fallback, and…

86823d7

… GTT support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(ui): enable rocm-smi support by correcting flags and parsing #580

fix(ui): enable rocm-smi support by correcting flags and parsing #580

Soddentrough commented Dec 7, 2025

Uh oh!

dkspwndj commented Dec 7, 2025

Uh oh!

Soddentrough commented Dec 7, 2025

Uh oh!

Soddentrough commented Dec 8, 2025

Uh oh!

dkspwndj commented Dec 8, 2025

Uh oh!

dkspwndj commented Dec 8, 2025

Uh oh!

Soddentrough commented Dec 8, 2025 •

edited

Loading

Uh oh!

Soddentrough commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

fix(ui): enable rocm-smi support by correcting flags and parsing #580

Are you sure you want to change the base?

fix(ui): enable rocm-smi support by correcting flags and parsing #580

Conversation

Soddentrough commented Dec 7, 2025

Uh oh!

dkspwndj commented Dec 7, 2025

Uh oh!

Soddentrough commented Dec 7, 2025

Uh oh!

Soddentrough commented Dec 8, 2025

Uh oh!

dkspwndj commented Dec 8, 2025

Uh oh!

dkspwndj commented Dec 8, 2025

Uh oh!

Soddentrough commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Soddentrough commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Soddentrough commented Dec 8, 2025 •

edited

Loading