Request: GPU Memory Junction Temperature via nvidia-smi or NVML API

If you would like to get some more insight into the NVML throttling - please spin up a new topic, and if possible ask any specific questions you have in mind - it helps me to get responses from the right folks.
Thanks so much !

Exactly!

2 Likes

So no MEM temperature under linux?
This is a F-ing joke nVidia…

4 Likes

+1 this is exactly what is needed, they however not doing it for some reason…

4 Likes

@nadeemm We don’t care about NVML, we care about TjMax being visible in nvidia-smi
Get it through your head already, its not difficult …

5 Likes

I don’t think you are grasping the issue:
Running ā€œnvidia-smi dmonā€ works in windows.
In linux it shows mtemp empty.

5 Likes

I don’t think you are grasping the issue:
Running ā€œnvidia-smi dmonā€ works in windows.
In linux it shows mtemp empty. Try it, please.
Really, many of us have been checking in on this frequently since february.

6 Likes

Hi NVidia.
Not sure if you are understanding the situation. It’s pretty simple.
The nvram junction temp, hottest part of gpu of 3090, needs to be monitored.
Surely, being that you have tested your product prior to shipping (right?), you are acutely aware of this.
I won’t comment on your usage of poor thermal pads. Nope, won’t do it.

HwInfo can do this on Windows; despite being not totally accurate.
We are perfectly happily with it not being perfectly accurate with a dedicated sensor. It’s far more intel than otherwise provided (none).
No one anywhere in this thread ever mentioned or cared about case temp. That is a derivation to the unimaginable n’th degree.
Implement a NVAPI update to provision said measurement for linux. Simple. Thanks.

5 Likes

Go for Amd they dont have problems with Linux. Bye

3 Likes

Hi,
We’re not asking for the case temperature as people have pointed out.
Here’s the output of nvidia-smi dmon -s pucvme
Note the mtemp column is empty on linux. In windows it displays the memory junction temperature.
That is the temperature we are after.

bertha āžœ ~ nvidia-smi dmon -s pucvme

gpu pwr gtemp mtemp sm mem enc dec mclk pclk pviol tviol fb bar1 sbecc dbecc pci # Idx W C C % % % % MHz MHz % bool MB MB errs errs errs

0   229    62     -   100   100     0     0 10451  1575     0     0  4845     9     -     -     0                0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0
0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0                0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0
0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0
0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0
0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0
0   229    62     -   100   100     0     0 10451  1560   100     0  4845     9     -     -     0
12 Likes

I really want to see the vram temperature values of the video cards :/

3 Likes

@2024a still no news?

3 Likes

I think we’re beating a dead horse here!

3 Likes

Maybe beating it daily will help getting it up again.

2 Likes

+1 f i x m e m t e m p o n l u n e x !

3 Likes

We really care only about TjMax value!!!

These cards are so expensive and should have appropriate support!

Please add an engineer to this task.

Thanks

6 Likes

I really want to voice in on the need for proper sensor data on Linux. Please help us on this!

3 Likes

Want to be another voice here. Still waiting. Either being able to see VRAM temps or being able to set the maximum VRAM temp (before throttling) would be helpful! VRAM temperatures are widely known to be an issue for gamers and creatives alike on 3080/90. Thanks.

2 Likes

@wpierce @nadeemm

Why is so difficult to understand the issue, is so easy to replicate in linux:

nvidia-smi dmon

More than 6 months and not a real solution here, is really annoying, especially for us that invested thousand of dollars in your products.

Why not to switch to AMD in the near future?

Thanks…

5 Likes

I’m glad to find this thread hoping that Nvidia hears its customers.

I agree with everyone else that Nvidia should enable Vram temp monitoring!!