When I attempted to load the “Hello World” example from Isaac Examples, the terminal shut down due to an error. The same issue occurs when trying to load other examples from Isaac.
I’ve already tried reinstalling the NVIDIA driver and Omniverse, including using the launcher-cleanup function, but the problem persists.
Hello,
I am getting the same VkResult: ERROR_DEVICE_LOST error resulting in Isaac crashing/hanging on version 2022.2.1 and 525.60.11 driver. This would occur shortly after Isaac starts, so I was unable to run even the examples.
Uninstalling and reinstalling Isaac would get me to the ray tracing compilation, but the error would occur again.
Below are my system specs.
$ nvidia-smi
Mon Aug 14 18:47:58 2023
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.60.11 Driver Version: 525.60.11 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... On | 00000000:04:00.0 Off | N/A |
| 30% 55C P8 16W / 250W | 10MiB / 24576MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 NVIDIA GeForce ... On | 00000000:09:00.0 On | N/A |
| 90% 73C P8 56W / 250W | 582MiB / 24576MiB | 1% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 1918 G /usr/lib/xorg/Xorg 4MiB |
| 0 N/A N/A 2905 G /usr/lib/xorg/Xorg 4MiB |
| 1 N/A N/A 1918 G /usr/lib/xorg/Xorg 53MiB |
| 1 N/A N/A 2905 G /usr/lib/xorg/Xorg 141MiB |
| 1 N/A N/A 3065 G /usr/bin/gnome-shell 85MiB |
| 1 N/A N/A 4817 G /usr/lib/firefox/firefox 163MiB |
+-----------------------------------------------------------------------------+
$ lshw -short
WARNING: you should run this program as super-user.
H/W path Device Class Description
============================================================
system Computer
/0 bus Motherboard
/0/0 memory 130GiB System memory
/0/1 processor AMD Ryzen 9 5900X 12-Core Processor
/0/100 bridge Starship/Matisse Root Complex
/0/100/0.2 generic Starship/Matisse IOMMU
/0/100/1.1 bridge Starship/Matisse GPP Bridge
/0/100/1.1/0 storage E16 PCIe4 NVMe Controller
/0/100/1.1/0/0 /dev/nvme0 storage Force MP600
/0/100/1.1/0/0/1 /dev/nvme0n1 disk NVMe namespace
/0/100/1.2 bridge Starship/Matisse GPP Bridge
/0/100/1.2/0 bridge Matisse Switch Upstream
/0/100/1.2/0/2 bridge Matisse PCIe GPP Bridge
/0/100/1.2/0/2/0 display NVIDIA Corporation
/0/100/1.2/0/2/0.1 multimedia NVIDIA Corporation
/0/100/1.2/0/4 bridge Matisse PCIe GPP Bridge
/0/100/1.2/0/4/0 enp5s0 network I211 Gigabit Network Connection
/0/100/1.2/0/8 bridge Matisse PCIe GPP Bridge
/0/100/1.2/0/8/0 generic Starship/Matisse Reserved SPP
/0/100/1.2/0/8/0.1 bus Matisse USB 3.0 Host Controller
/0/100/1.2/0/8/0.3 bus Matisse USB 3.0 Host Controller
/0/100/1.2/0/9 bridge Matisse PCIe GPP Bridge
/0/100/1.2/0/9/0 storage FCH SATA Controller [AHCI mode]
/0/100/1.2/0/a bridge Matisse PCIe GPP Bridge
/0/100/1.2/0/a/0 storage FCH SATA Controller [AHCI mode]
/0/100/3.1 bridge Starship/Matisse GPP Bridge
/0/100/3.1/0 display NVIDIA Corporation
/0/100/3.1/0.1 multimedia NVIDIA Corporation
...
Update: On the same system in Windows the application also crashes, attached is the log file kit_20230815_105609.log (1.1 MB)
Update 2: On the system with one GPU unpowered and fresh driver (525.60.11) installation, the application crashes after opening the hello world example. kit_20230815_142152.log (1.4 MB)
An update of the BIOS seems to have resolved this issue. I can now run Isaac navigation example in both Ubuntu 20.04 and Ubuntu 22.04. FYI, the motherboard model is GIGABYTE X570 AORUS ELITE.