I have written a simple code to test dynamic parallelism, the code is given below#include <stdio.h>
global void d_hello()
int a = 5;
global void hello()
I have compiled the code using the flags -arch=compute_37,-rdc=true and the compilation is not throwing any errors and the executable is generated. But when I run the executable I see no output.
I am using Tesla K80 fro aws instance and my cuda version is 7.5.
Thanks for the help in advance! :)