When i set the brake point inside the kernel, i can enter it, but I can’t see what’s the result of the execution. The debugger shows me how he goes through the code, but I can’t see any of the results of the execution. I mean if I write a=5; I usually see in the lower left corner that the variable a got the value 5, but here I don’t see anything. Can anybody help me with this problem?
Same problem - can’t enter a global , i.e. kernel, function. Even if I put a break in it - once I press F5 the breakpoint goes inactive and VS says this line won’t be hit 'cause there’s no code associated for this line… even if this global is in the same file as it’s host launcher function (one containing the foo<<<>>> (…) call) - it won’t work. :(
Yes, of course I set -deviceemu, and -debug info generation, disabled optimization in both VC++ and NVCC compilers…
I have a stong feeling that this is because of a wrong cuda build rule. I mean - the CUDA property branch in project settings differs greatly depending on which custom build rule is used, and at first (with Wizard’s build rule) - it didn’t break inside any .cu file at all. Now (with CUDA build rule v3.0.14 from SDK_v3.1_x32) it does debug host functions in .cu, but not kernels…
So - wich cuda build rule is correct? And what type of output should be generated for cuda files (.gpu, .cudabin, .cuda, .ptx, .cu.c or which?)
Same problem - can’t enter a global , i.e. kernel, function. Even if I put a break in it - once I press F5 the breakpoint goes inactive and VS says this line won’t be hit 'cause there’s no code associated for this line… even if this global is in the same file as it’s host launcher function (one containing the foo<<<>>> (…) call) - it won’t work. :(
Yes, of course I set -deviceemu, and -debug info generation, disabled optimization in both VC++ and NVCC compilers…
I have a stong feeling that this is because of a wrong cuda build rule. I mean - the CUDA property branch in project settings differs greatly depending on which custom build rule is used, and at first (with Wizard’s build rule) - it didn’t break inside any .cu file at all. Now (with CUDA build rule v3.0.14 from SDK_v3.1_x32) it does debug host functions in .cu, but not kernels…
So - wich cuda build rule is correct? And what type of output should be generated for cuda files (.gpu, .cudabin, .cuda, .ptx, .cu.c or which?)