Measure GPU temperature in Linux ?

I’d like to know if it is possible to measure GPU temperature in Linux. (Reading GPU core temperature reading)
I’ve seen previous posts about running “nvidia-settings” and look at the GPU temperature.

  1. Is there any way to check the temperature when x-windows is started (setting /etc/inittab runlevel 3)
  2. Is there any way to dump out the temperature readings to a separate text file ?

Thanks

The nvidia-settings control panel shows the gpu core temperature, so does the third-part nvclock utility. There isn’t a sensor logging package that I know of with gpu capabilities, but nvclock and about 10 lines of bash script should take care of it.

Thanks for the reply

Could you help me a little bit more on this ?

Is nvclock executable under non-graphics environment ?

Yes and no. nvclock is a command line utility, so you can run it from any shell, but it seems that whatever interface it uses only loads when the X11 modules are loaded and X11 is running (or at least that is the case with the hardware I have access to).

Hi

I faced the same issue and wrote a small script which does the job. I have written it up in my blog

http://hobiger.org/blog/2008/12/22/logging…re-of-your-gpu/

Maybe that’s of some help for you.

Cheers.

Hello !

You can also use lm-sensors:
http://ubuntuforums.org/showthread.php?t=2780

Best regards.

I’m using CentOS 5.3 with 2.6.18-164.el5 kernel.

Right after the initial installation of the 190.18 driver for nvidia, I could see the thermal monitor readings from the nvidia-settings.

But after a reboot, now I can’t see the thermal monitor readings from the nvidia-settings panel. When I run nvidia-smi -lsa to probe the GPU core temperature, I get the following message. Rebooting or re-installing the cuda driver didn’t work.

Does anyone have any idea on how to troubleshoot this problem?

$ nvidia-smi -lsa

==============NVSMI LOG==============

Timestamp					   : Tue Oct  6 13:42:26 2009

GPU 0:

		Product Name			: GeForce GTX 285

		PCI ID				  : 5e310de

Failed to read GPU temperature!

		Temperature			 : 0 C

GPU 1:

		Product Name			: GeForce GTX 285

		PCI ID				  : 5e310de

Failed to read GPU temperature!

		Temperature			 : 0 C

GPU 2:

		Product Name			: GeForce GTX 285

		PCI ID				  : 5e310de

Failed to read GPU temperature!

		Temperature			 : 0 C