Hello,
I am trying to figure out why cuda is not working on an AGX Orin Dev Kit. I’ve tried numerous things from the Samples dir from the cuda ??? Every single cuda-enabled software I try to run reports no cuda devices found. I wonder if something isn’t configured/installed correctly after my previous debacle[0]?
Example:
cuda-samples/Samples/1_Utilities/deviceQuery$ ./deviceQuery
./deviceQuery Starting…
CUDA Device Query (Runtime API) version (CUDART static linking)
cudaGetDeviceCount returned 100
→ no CUDA-capable device is detected
Result = FAIL
lsmod|grep NVRM
[Sun Jun 19 14:50:46 2022] NVRM: loading NVIDIA UNIX Kernel Module for aarch64 34.1.1 Release Build (buildbrain@mobile-u64-5414-d7000) Mon May 16 21:12:24 PDT 2022
[Sun Jun 19 14:51:00 2022] NVRM gpumgrGetSomeGpu: Failed to retrieve pGpu - Too early call!.
[Sun Jun 19 14:51:00 2022] NVRM nvAssertFailedNoLog: Assertion failed: NV_FALSE @ gpu_mgr.c:295
[Sun Jun 19 14:51:00 2022] nvRmApiAlloc+0x30/0x40 [nvidia_modeset]
[Sun Jun 19 14:51:00 2022] NVRM gpumgrGetSomeGpu: Failed to retrieve pGpu - Too early call!.
[Sun Jun 19 14:51:00 2022] NVRM nvAssertFailedNoLog: Assertion failed: NV_FALSE @ gpu_mgr.c:295
[Sun Jun 19 14:51:01 2022] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call result 0x56:
[Sun Jun 19 14:51:01 2022] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call result 0x56:
[Sun Jun 19 14:51:01 2022] NVRM rmapiAllocWithSecInfo: allocation failed; status: Given class-id not valid [NV_ERR_INVALID_CLASS] (0x00000022)
[Sun Jun 19 14:51:01 2022] NVRM rmapiAllocWithSecInfo: client:0xc1d00001 parent:0xcaf00001 object:0x0 class:0x402c
[Sun Jun 19 14:51:01 2022] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call result 0x56:
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] nvRmApiAlloc+0x30/0x40 [nvidia_modeset]
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call result 0xffff:
[Sun Jun 19 14:51:01 2022] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call result 0x56:
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1175
[Sun Jun 19 14:51:01 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1190
[Sun Jun 19 14:51:01 2022] NVRM rpcRmApiControl_dce: NVRM_RPC_DCE: Failed RM ctrl call result 0xffff:
[Sun Jun 19 14:51:02 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1175
[Sun Jun 19 14:51:05 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1175
[Sun Jun 19 14:51:05 2022] NVRM nvAssertFailed: Assertion failed: 0 @ g_mem_mgr_nvoc.h:1175
[Sun Jun 19 14:52:14 2022] NVRM: failed to register with the ACPI subsystem!
[Sun Jun 19 14:52:14 2022] NVRM: failed to unregister from the ACPI subsystem!
which nvcc
/usr/local/cuda/bin/nvcc
nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2021 NVIDIA Corporation
Built on Thu_Nov_11_23:44:05_PST_2021
Cuda compilation tools, release 11.4, V11.4.166
Build cuda_11.4.r11.4/compiler.30645359_0
I have a working GUI and everything else (as far as I can tell) is okay. Does anyone have a suggestion? Thank you!