[DGX Spark] XHCI Controller "HC Died" crashes with RealSense D435i streaming 30fps (depth, RGB)

Hi everyone,

I’m running into an stability issue with the USB subsystem on my DGX Spark and wanted to see if anyone else has encountered this or if it’s a known limitation of the current Hardware or anything from FW/Driver. To rule out cable issues I am able to stream, toggling on/off many time without issue on another Ubunut linux laptop.

The Setup:

  • Hardware: NVIDIA DGX Spark

  • Device: Intel RealSense D435i

  • Connection: Tested both Directly to the port and via a Powered StarTech Industrial Hub (Wall-powered).

  • Workload:

    • Standard RGB stream (640x480 @ 30fps) using the Native V4L2 driver (I did try RUSB driver also and it failed the same way) whenever turning on both RGB + depth
    • Also crashed if toggling on/off with only RGB only from realsense-viewer
[  429.539999] xhci-hcd NVDA8000:02: xHCI host not responding to stop endpoint command
[  429.602116] xhci-hcd NVDA8000:02: Host halt failed, -110
[  429.602119] xhci-hcd NVDA8000:02: xHCI host controller not responding, assume dead
[  429.603098] xhci-hcd NVDA8000:02: HC died; cleaning up

bug_report_dgx_spark.tar.gz (37.5 KB)

Thank you for the logs and the bug report. We will look into this and get back to you

I am having a similar issue with one of my DGX sparks. I can SSH and SFTP to the box. RDC will not connect. Nor can I get the Gnome Desktop to boot There are text errors that are displaying through the KVM display. My other DGX works fine through both RDC and the KVM.

[ xhci_endpoint_init rsv 0x801 ]

(9 entries of this error)

usb 1:1:device descriptor read/64, error -71

(2 entries of this error)

How do I recover from this?