mirror of
https://github.com/XuehaiPan/nvitop.git
synced 2026-05-15 14:15:55 -06:00
[GH-ISSUE #90] [BUG] nvidia-ml-py-12.535.77 兼容性問題 #52
Labels
No labels
api
bug
bug
cli / tui
dependencies
documentation
documentation
documentation
duplicate
enhancement
exporter
invalid
pull-request
pynvml
question
question
upstream
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: github-starred/nvitop#52
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @hui-zhao-1 on GitHub (Aug 17, 2023).
Original GitHub issue: https://github.com/XuehaiPan/nvitop/issues/90
Originally assigned to: @XuehaiPan on GitHub.
Required prerequisites
What version of nvitop are you using?
1.2.0
Operating system and version
Ubuntu 20.04.6 LTS (Focal Fossa)
NVIDIA driver version
535.86.10
NVIDIA-SMI
Python environment
3.8.17 (default, Jul 5 2023, 21:04:15)
[GCC 11.2.0] linux
nvidia-ml-py==12.535.77
nvitop==1.2.0
Problem description
测试机器使用的 nvidia driver https://international.download.nvidia.com/tesla/535.86.10/NVIDIA-Linux-x86_64-535.86.10.run


在该 版本 driver 下,运行 nvitop 无法正常看到运行中的进程:
Steps to Reproduce
排查发现,nvitop 报了这个错误:

Traceback
Logs
No response
Expected behavior
No response
Additional context
No response
@XuehaiPan commented on GitHub (Aug 17, 2023):
Duplicate #88, would be fixed by #89.
@hui-zhao-1 commented on GitHub (Aug 17, 2023):
排查日志,怀疑是 https://github.com/XuehaiPan/nvitop/blob/main/nvitop/api/libnvml.py line 590 的 __determine_get_running_processes_version_suffix() 这个方法有问题
我这边没有看懂这里为什么要通过 'nvmlDeviceGetConfComputeMemSizeInfo' 判断 版本号,所以 fork 代码把这个判断注释以后,解决了这个问题,参考:
cc3ad6da51@hui-zhao-1 commented on GitHub (Aug 17, 2023):
已经通过 pip3 install git+https://github.com/XuehaiPan/nvitop.git 验证 问题解决了