Tensorflow Crash - nvgpu error on does BUG: non-zero nr_ptes on freeing mm: 1

Target Operating System

Hardware Platform
NVIDIA DRIVE™ AGX Xavier DevKit (E3550)

Host Machine Version
native Ubuntu 18.04

Jetson AGX Xavier node hangs. When checked dmesg, the error seems to be a bug related to handling memory. Google search on the issue points to a kernel bug with kernel 4.x however, I am not sure if the same error still exists on official L4T kernel as well.

Hi pjanakar,

May I know you’re using Jetson AGX Xavier platform or DRIVE AGX platform?
Which JetPack or DRIVE SW you’re using?