Cuda Runtime API error for cuda Graph and OpenCV

fd1 · May 24, 2022, 9:18am

Hello there,

I created a Cuda Graph, using the runtime api and recording the operations; for now it’s a just a simple loop of resize operations from opencv;

I then try to inspect the graph using a simple operation as calling:

auto getKernelParams(cudaGraphNode_t &n) -> cudaKernelNodeParams
{
    cudaKernelNodeParams par;
    cudaGraphKernelNodeGetParams(n, &par);
};

but the call to cudaGraphKernelNodeGetParams return error “cudaErrorInvalidDeviceFunction”;

What are the reasons of this kind of error?
Cuda is installed using the sdk manager. Ubuntu 18, Jetpack 4.6 and Cuda 10.2;

AastaLLL · May 25, 2022, 3:07am

Hi,

You can find the detailed error information in our document below:

https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html
cudaErrorInvalidDeviceFunction = 98
The requested device function does not exist or is not compiled for the proper device architecture.

The error indicates your CUDA app has not complied with NX architecture.
Please make sure you have added this to the nvcc configuration.

For example:

$ nvcc -gencode arch=compute_72,code=sm_72 test.cu

Thanks.

fd1 · May 25, 2022, 8:46am

Thanks @AastaLLL for the answer;

The problem is that I don’t compile anything with nvcc; moreover OpenCV is installed using your JEP script (correct architteture). The error is just coming from the call to the API.

I’m using CMakeLists to find Cuda and the linkage is correct, also according to the documentation, the set-get function for kernel parameters are the only 2 function that actually can return that specific error.

So I’m curious about other reason for the error apart from the wrong device architecture.

AastaLLL · May 27, 2022, 3:31am

Hi,

You can set GPU architecture in CMakeLists as well.
Here is an example for your reference:

github.com

dusty-nv/jetson-inference/blob/master/CMakeLists.txt#L74


      
          
          
# setup CUDA
          set(CMAKE_MODULE_PATH ${CMAKE_MODULE_PATH} "${CMAKE_CURRENT_SOURCE_DIR}/utils/cuda" )
          find_package(CUDA)
          message("-- CUDA version: ${CUDA_VERSION}")
          
          
set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS}; -O3)
          
          
if(CMAKE_SYSTEM_PROCESSOR MATCHES "aarch64")
          	message("-- CUDA ${CUDA_VERSION} detected (${CMAKE_SYSTEM_PROCESSOR}), enabling SM_53 SM_62")
          	set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS}; -gencode arch=compute_53,code=sm_53 -gencode arch=compute_62,code=sm_62)
          
          
	if(CUDA_VERSION_MAJOR GREATER 9)
          		message("-- CUDA ${CUDA_VERSION} detected (${CMAKE_SYSTEM_PROCESSOR}), enabling SM_72")
          		set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS}; -gencode arch=compute_72,code=sm_72)
          	endif()
          
          
	if(CUDA_VERSION_MAJOR GREATER 10)
          		message("-- CUDA ${CUDA_VERSION} detected (${CMAKE_SYSTEM_PROCESSOR}), enabling SM_87")
          		set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS}; -gencode arch=compute_87,code=sm_87)
          	endif()

Please noted that the GPU architecture of Xavier NX is sm_72.
Thanks.

fd1 · May 27, 2022, 12:52pm

Hi @AastaLLL,
I implemented your suggestion but it didn’t change anything; I made a small sample to reproduce the “error”, it might be helpful to understand better what is going on; I tried to stay close to the structure of the main project but there are some differences in the code since we are using cpp20 and GCC11.x;

The sample compiles with GCC7.5; and the build commands are in the bash script.
sample.zip (6.3 KB)

Thanks.

AastaLLL · May 30, 2022, 6:35am

Thanks for sharing the source with us.

We are going to check it internally.
Will get back to you later.

AastaLLL · June 16, 2022, 4:38am

Hi,

Thanks for your patience.

We try to reproduce this issue on a JetPack 4.6.1 environment.
But the compiling fails due to a missing OpenCV header as below:

In file included from /home/nvidia/topic_215408/sample/SampleLib/processor.hpp:4:0,
                 from /home/nvidia/topic_215408/sample/exec/sample.cpp:1:
/home/nvidia/topic_215408/sample/SampleLib/image_processing.hpp:4:10: fatal error: opencv2/cudawarping.hpp: No such file or directory
 #include <opencv2/cudawarping.hpp>
          ^~~~~~~~~~~~~~~~~~~~~~~~~
compilation terminated.

Is this reproducible with the default OpenCV package?
If not, could you share which OpenCV version you use?

Thanks.

fd1 · June 17, 2022, 6:32am

Hi,

The OpenCV package is built from sorce using your(I guess) script JEP. Version is 4.5.4 and it is built with cuda support.

IIRC there might be some differences like not building for python and/or using ccache; I’ll attach the generated makefile for a comparison if needed.
makefile_text.txt (332.0 KB)

AastaLLL · June 28, 2022, 6:23am

Hi,

Here are some updates for you:

We can reproduce this issue internally.
It looks like there are issues related to the CUDA Graph but we need more time to investigate.
Will give you an update once we got further information.

Thanks.

fd1 · June 28, 2022, 8:37am

Thanks @AastaLLL

I’ll wait for the update.

AastaLLL · July 13, 2022, 8:38am

Hi,

Thanks for your patience.
This issue is related to CUDA runtime API.

If you create the CUDA graph from a dynamic library and tries to introspect it outside the library.
The query functions may fail because the nodes reference CUDA C++ symbols that belong to a different runtime and are not in the local runtime’s map.

The workaround currently is to use driver APIs instead.
We can run your sample without issue after switching to the driver APIs.

Here is our change for your reference: driverAPIs.patch (3.4 KB)

Thanks.

fd1 · July 13, 2022, 10:13am

Thanks to you.

system · August 3, 2022, 3:04am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
OpenCV Cuda: No Kernel Image is Available Jetson Xavier NX opencv , cuda	8	5489	October 18, 2021
NVCC fatal error, make: *** [cudaobj/Debug/fkt_alles_cuda.o] Error 1 - Solved CUDA Setup and Installation	4	1594	February 4, 2019
Runtime error on Ubuntu 8.04 with latest cuda release CUDA Programming and Performance	7	14919	January 4, 2009
cudaGraphicsMapResources() and cuCtxCreate() incompatible? CUDA Programming and Performance	9	1848	April 7, 2018
Error trying to use OpenCV with CUDA support on Docker: CUDA driver version is insufficient for CUDA runtime version Jetson Nano opencv , cuda , docker	4	2823	June 27, 2022
Correct compute architecture for TX2 and OpenCV4Tegra compute architecture Jetson TX2 opencv	13	5598	October 18, 2021
All CUDA-capable devices busy or unavailable Jetson TX2 cuda	9	3969	December 28, 2021
Error when compiling Cuda samples Jetson Orin NX cuda	5	183	August 26, 2024
Running Cuda application built and running on TX2 on Xavier Jetson AGX Xavier	7	1229	October 18, 2021
undefined reference to `cudaSetupArgument', `cudaLaunch' CUDA Programming and Performance	9	5959	November 12, 2019

Cuda Runtime API error for cuda Graph and OpenCV

Related topics