I am checking the title error log on a custom board using TX2/4GB.
Below are the questions.
What is “isr712” and what causes it?
Is “gk20a” the identifier of the log output source program? If you know a specific program, please let me know.
If 2 was “Yes”, there was also a log called “gp10b”. If you can identify the program for this as well, please let me know the program that generated the output.
Is not implemented something that the JetPack standard does not support but should be implemented by the user?
May I know how the error was presented?
[93177.355290] nvgpu: 17000000.gp10b gk20a_pmu_isr:724 [ERR] pmu exterr intr not implemented. Clearing interrupt.
[2023-06-16 19:54:39.731] Jpeg Size =17792!!
Any application running?
Yes.
Which JetPack release?
R32.4.4
An error from gk20a/gp10b indicates it is from nvgpu driver. For such issue, please clarify if this is causing any fatal result. If it does not cause system crash or application crash, then you could ignore it.
It does not mean you have to implement it. It is also pointless to share more detail to you about this driver.
It seems possible to change the version of the BSP part of JetPack.
Is it the part that has been repaired in the BSP version?
Also, since the Pinmux settings are different from the evaluation board, evaluation on the evaluation board is not possible.
What we want to know is what is triggering this phenomenon.
Excuse me, let me check just in case.
Let me confirm what “at least the BSP part is up to date” assumes.
a) Is it OK to update only the parts other than the source/rootfs, such as nVIDIA-provided binaries (including bootloader up to cboot), configuration settings, scripts such as flash.sh?
In this case, the Linux Kernel, rootfs, etc. are all left as they are and there is no need to rebuild them.
(In other words, the binary provided by nVIDIA, the flash environment, configuration settings, etc. may be the cause).
b) Is the Linux Kernel and glibc in the rootfs and other libraries and tools (such as CUDA) updated as well?
The apps developed here do not need to be changed.
(Are there any suspicious parts in Kernel, Kernel built-in debadora, glibc, libraries in rootfs, command daemons, etc.?)
Which image do you think it will be?
Answer a) or b).
By the way, with L4T 32.4.4 ⇒ 32.7.4, at least the kernel version has been updated from 4.9.140 ⇒ 4.9.337.
I mean the whole system should be upgraded. When system got upgraded, the SDK (CUDA/TRT) will be upgraded too.
It is not possible to tell whether this issue got fixed here or not. Maybe you would still hit issue here.
But this is kind of SOP.
For example, rel-32.1 supports TX2 too but it is a software released 4 years ago and we may already resolved lots of issue since then. It is not a good idea to fallback to old release to debug. Same to rel-32.4.4. 32.4.4 is already 3 years ago BSP.
If you cannot upgrade your system because it is custom board, then try to reproduce this issue on NV devkit + latest BSP.
To debug this issue, a upgrade is needed. But that is just for debug.
As I already said, If you cannot upgrade your system because it is custom board, then try to reproduce this issue on NV devkit + latest BSP. If it can reproduce on latest BSP, then we will try to fix this on latest BSP.
If it cannot reproduce on latest BSP, it probably means this issue had already be fixed.