Using CMake with Matlab GPU Coder generated CUDA code

kubaixixel · March 2, 2021, 10:40am

Hey, I created simple function to return cos of an array in matlab. I generated this function in GPU Coder an got cuda code. I have also succesfully generated and executed executable with that function generated in GPU coder on Jetson nano. But when I try to generate same functions with same main file with cmake, executable is succesfully generated, but it returns array of zeros. However both GPU coder and CMake executables work, when the matlab functions input is scalar, instead of and array (1x100 in my case).
This is matlab function, I want CUDA from.

This is the generated CUDA function:

This is how I call it:

When I generate this on jetson with cmake with cuda 10.2. installed, output array is 0.
When I generate this on Windows PC with cmake, output array is correct.
I managed to run it on jetson in debug mode.

Variables in first breakpoint:

In breakpoint on the line 45 variables are:

Then after calling cudaMalloc variables are

dusty_nv · March 3, 2021, 1:41am

Hi @kubaixixel, I’m not familiar with using MATLAB GPU Coder, so maybe someone from the community can share their experiences. Or you may also want to contact MATLAB support.

One thing you can try, is adding a printf inside your CUDA kernel to log the values and see if they are being generated.

Another thing you can do, is add error checking to the CUDA API calls such as cudaMalloc() and cudaMemcpy(). You can also call cudaGetLastError() after your kernel. Here are some example error checking macros:

https://github.com/dusty-nv/jetson-utils/blob/1f3709f48258c2d75500c35605e8f6f4a3447afc/cuda/cudaUtility.h

kubaixixel · March 3, 2021, 12:21pm

Thanks, I will have a look at that. Also forgot to add that when I generate the executable directly in Matlab GPU coder, vector input works, so I have to think that it has something to do with building the executable. This is my CMakeLists.txt:

dusty_nv · March 4, 2021, 2:55pm

@kubaixixel if you are on Jetson Nano, you should be using -gencode arch=compute_53,code=sm_53 in your CMakeLists.txt. My guess is the CUDA kernel was failing to launch because it was compiled for the wrong GPU arch, but you didn’t see this failure because there was no error checking.

For reference, you can enable list all of these if you want to compile it for all Jetson devices:

-gencode arch=compute_53,code=sm_53
-gencode arch=compute_62,code=sm_62
-gencode arch=compute_72,code=sm_72

kubaixixel · March 4, 2021, 3:33pm

@dusty_nv thank you, that solves it.

Topic		Replies	Views
Using CMake with Matlab GPU Coder generated CUDA code Jetson Nano matlab	6	820	October 15, 2021
Attempting to make CUDA files in the latest release of the Accelerated Computing Teaching Kit on a Nano Jetson Nano cuda	6	543	November 17, 2021
Compile on Jetson TX2 a OpenCV example WITH_CUDA=ON Jetson TX2	5	5899	October 18, 2021
OpenCV 3.1.0 and CUDA make Error On TK1 Jetson TK1	5	2224	October 18, 2021
Error running Cuda Code on Jetson TX1 Jetson TX1	4	694	October 18, 2021
CMake & nvcc 11.3.109 CUDA Developer Tools	0	735	June 21, 2021
How to do CUDA programming on Jetson Nano? Jetson Nano	4	7315	October 18, 2021
GPU Coder Matlab to Jetson Nano - Build error : C++ compiler Jetson Nano cuda , machine-learning	3	1111	May 20, 2022
Building Opencv with CUDA Jetson Orin Nano opencv , cuda	8	1900	March 15, 2024
Jetson Nano CPP Support Jetson Nano	12	1381	April 10, 2023

Using CMake with Matlab GPU Coder generated CUDA code

Related topics