mirror of
https://github.com/XuehaiPan/nvitop.git
synced 2026-05-15 06:06:12 -06:00
[PR #208] Fix incorrect memory reporting on coherent UMA platforms (GB10 / DGX … #212
Labels
No labels
api
bug
bug
cli / tui
dependencies
documentation
documentation
documentation
duplicate
enhancement
exporter
invalid
pull-request
pynvml
question
question
upstream
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: github-starred/nvitop#212
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/XuehaiPan/nvitop/pull/208
Author: @parallelArchitect
Created: 4/16/2026
Status: 🔄 Open
Base:
main← Head:fix/gb10-coherent-uma-memory-reporting📝 Commits (2)
de09aebFix incorrect memory reporting on coherent UMA platforms (GB10 / DGX Spark)2ca5797fix: replace UMA acronym in comment to pass spell check📊 Changes
1 file changed (+15 additions, -7 deletions)
View changed files
📝
nvitop/api/device.py(+15 -7)📄 Description
Fix incorrect memory reporting on coherent UMA platforms (GB10 / DGX Spark)
On GB10 / DGX Spark,
nvmlDeviceGetMemoryInforeturnsNVML_SUCCESSwithtotalequal to systemMemTotal(~121GB). This causes nvitop to display full system RAM as GPU memory instead of actually allocatable memory.The existing
NVMLError_NotSupportedpath correctly handles some UMA platforms, but GB10 returnsNVML_SUCCESS— notNOT_SUPPORTED— so it falls through to the discrete GPU path and displays wrong values.Issue Type
Description
Detect coherent UMA by comparing NVML-reported
totalagainst system virtual memory total. If total >= 90% of system RAM, classify as unified memory and use system virtual memory (MemAvailable) for display instead.Preserves existing behavior for discrete GPUs.
Motivation and Context
Same root cause documented and fixed in:
Note
Requires validation on GB10 / DGX Spark hardware. The fix has not been independently validated on a coherent UMA system.
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.