What this 'soctherm: trip temperature' message means?

Hi,

When I run my c code I get the message:

tegra_soctherm 700e2000.soctherm: soctherm: trip temperature -2147483647 forced to - 127000

Now, I know that is the result of checking and enforcing temperature range min max and those are in millicelsius, but, is it normal and why does it happen?

Is there anyway to avoid it?

Thanks

Hi acessonegato,

Do you see the message on Jetson TX1 Developer Kit? If yes, how do you get the message?

Hi,

Yes, 28.1, I got the message when I run my c program, four of them, I guess one for each core.

Searching in google, I found that many people, whom problem is not related to this message, have this message in the kernel, so I guess is normal. However would be nice if Nvidia could investigate, since it does not make any sense.

Would be nice if Nvidia could include in the documentation, the dmesg of a normal working tx1, so that one can compare and see what message can be ignored.

Also in the kernel I got:

pci 0000:00:01.0: bridge configuration invalid ([bus 00-00]), reconfiguring

fast_enable_write: Failed to get cpuidle driver

vi 54080000.vi: vi_probe: failed
vi: probe of 54080000.vi failed with error -22

xhci-tegra 70090000.xusb: can’t get usb3-1 phy (-517)

tegra_soctherm 700e2000.soctherm: throttrip: pll: missing hot temperature
tegra_soctherm 700e2000.soctherm: failed to register sensor: -19

gk20a 57000000.gpu: platform probe failed

and some others that seems to be common.

However everything seems to work.

Thanks.

Thanks for your suggestion. Could you provide a simple c program for reproducing the messages? Then we can quickly look into it.

Hi,

That happened only once, not always, but as I said is a common message in the kernel that everyone has. Most probably a bug in the soctherm.

Could you help me with the other messages?

Thanks,

What’s the other message? Do you mean below? Are they sequential messages? If they are different topics, please help to file different topics for tracking. Thanks!

See code on:linux/soctherm.c at master · torvalds/linux · GitHub

It is printed in a temperature checking method and related to temperature exceeding limits

Thanks