dynamic parallelism with cmake

soni · March 18, 2016, 7:27am

hello…
I have been trying to compile a .cu file which consist of a parent kernel and a child kernel using cmake;

my .cu file:
global void childKernel()
{
printf("Hello ");
}

global void parentKernel()
{

childKernel<<<1,5>>>();
cudaDeviceSynchronize();

printf(“World!\n”);
}

my cmake file:

find_package(CUDA QUIET REQUIRED)

include_directories(/usr/include)
include_directories(/usr/local/cuda/lib)

    set(CUDA_SEPARABLE_COMPILATION ON)
set(CUDA_PROPAGATE_HOST_FLAGS OFF)

set(CUDA_NVCC_FLAGS "-arch=compute_53;-code=sm_53; -rdc=true -O3" )

set(PROJECT_LINK_LIBS  -L/usr/local/cuda/targets/armv7-linux-gnueabihf/lib -lcudadevrt -L/usr/local/cuda/targets/armv7-linux-gnueabihf/lib -lcublas -L/usr/local/cuda/targets/armv7-linux-gnueabihf/lib -lcublas_device)

    target_link_libraries(DRA ${PROJECT_LINK_LIBS}  ${CUDA_LIBRARIES})

when i compile the files using the command make I don’t get any error but when i run the .out file the kernels are not launched.

I am not able to find where the error/bug is

soni · March 18, 2016, 8:08am

When i compile the same file directly on the terminal using:

nvcc -arch=compute_53 -code=sm_53 -rdc=true helloworld.cu -o hello -lcudadevrt

it works fine

KapilMehta · March 21, 2016, 10:48am

hello,

I am also facing the same issue…

i am setting flags in cmake as

set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS} “-arch=compute_53 -code=sm_53 -rdc=true -O3” )

and i set set(CUDA_VERBOSE_BUILD ON)

i think -arch=compute_53 -code=sm_53 -rdc=true -O3 flags are being passed to /usr/bin/cc instead of nvcc and that’s why kernel is not getting launched because of absence of dynamic parallelism flags.

Any insight is highly appreciated…

tera · March 21, 2016, 9:23pm

I’m not particularly using cmake other than when existing code forces it on me, but what output do you get when running

make VERBOSE=1

? That should be the first step in diagnosing the problem.

Topic		Replies	Views
How to compile the Dynamic Parallelism CUDA code by cmake ? CUDA Programming and Performance	0	1276	February 15, 2017
Compile cuda program with Dynamic Parallelism Jetson TX2	4	3816	October 18, 2021
Dynamic Parallelism on TX1 Jetson TX1	3	2563	April 28, 2016
Cuda Dynamic Parallelism trigger thrust error CUDA Programming and Performance cuda	4	1032	October 21, 2022
Cmake dynamic parallel compilation, works on V100 but errors on RTX2060 CUDA NVCC Compiler	0	650	January 7, 2022
Problems with dynamic parallelism in Ubuntu 14.04 and CUDA 6.5 CUDA Programming and Performance	2	1185	September 24, 2014
calling a __global__ function() from a __global__ function CUDA Programming and Performance	9	9977	August 3, 2019
Cannot use dynamic parallelism CUDA Setup and Installation	5	2131	June 22, 2016
CMake CUDA dynamic parallelism LINK2001 error CUDA Programming and Performance	0	773	January 10, 2018
Trouble building with Dynamic Parallelism on Nsight Eclipse CUDA Setup and Installation	1	1727	May 6, 2013

dynamic parallelism with cmake

Related topics