I’m trying to capture kernel crashes on the Orin NX. I’ve followed the instructions here: Kernel Debugging Tools
When I test the configuration the kexec does not completely boot if the nvidia_drm module is loaded. I get memory controller errors which prevent the successful completion of the initramfs boot:
[ 15.305391] arm_smmu_global_fault: 1002413 callbacks suppressed
[ 15.305401] arm-smmu 10000000.iommu: Blocked unknown Stream ID 0x1; boot with “arm-smmu.disable_bypass=0” to allow, but this may have security implications
[ 15.305404] arm-smmu 10000000.iommu: GFSR 0x80000002, GFSYNR0 0x00000000, GFSYNR1 0x00000001, GFSYNR2 0x00000000
[ 15.305410] tegra30_mc_handle_irq: 641856 callbacks suppressed
[ 15.305412] tegra-mc 2c00000.memory-controller: nvdisplayr1: secure read @0x000000ffffffff00: EMEM address decode error (EMEM decode error)
[ 15.305434] arm-smmu 10000000.iommu: Blocked unknown Stream ID 0xc01; boot with “arm-smmu.disable_bypass=0” to allow, but this may have security implications
If I unload the nvidia_drm modules the kexec kernel boots ok.
The problem starts with the iommu configuration earlier in the boot log:
[ 0.295062] arm-smmu 10000000.iommu: SMMUv2 with:
[ 0.295064] arm-smmu 10000000.iommu: stage 1 translation
[ 0.295065] arm-smmu 10000000.iommu: stage 2 translation
[ 0.295066] arm-smmu 10000000.iommu: nested translation
[ 0.295068] arm-smmu 10000000.iommu: stream matching with 128 register groups
[ 0.295070] arm-smmu 10000000.iommu: 128 context banks (0 stage-2 only)
[ 0.295073] arm-smmu 10000000.iommu: Supported page sizes: 0x61311000
[ 0.295075] arm-smmu 10000000.iommu: Stage-1: 48-bit VA → 48-bit IPA
[ 0.295076] arm-smmu 10000000.iommu: Stage-2: 48-bit IPA → 48-bit PA
[ 0.295121] arm-smmu 10000000.iommu: Unexpected global fault, this could be serious
[ 0.295123] arm-smmu 10000000.iommu: GFSR 0x80000001, GFSYNR0 0x00000000, GFSYNR1 0x00000801, GFSYNR2 0x00000000
What do I need to do to solve this? Reconfigure the memory map for the crash kernel? Use a kernel parameter to reset the display hw? Something else?
Configuration is Jetpack 6.1/L4T R36.4.3