Nsight skips (ignores) over break points in VS10 Cuda works fine, nsight consistently skips over sev

TripleS · May 30, 2012, 7:56pm

Hey

I’m using nsight 2.2 , Toolkit 4.2 , latest nvidia driver , I’m using couple gpu’s in my computer. Build customize 4.2.

I have set “generate GPU ouput” on CUDA’s project properties, nsight monitor is on (everything looks great).

I set several break points on my global - kernel function . nsight stops at the declaration of the function , but skips over several break points. it’s just like nsight decide whether to hit a break point or skip over a break point.

The funny thing is that nsight stops at for loops , but doesn’t stop on simple assignment operations.

One more problem is that I can’t set focus or add variables to the watch list , In this case (see attached screenshot)I can’t resolve the value of variable : “posss” or “testDetctoinRate1”

which are registers in this case. on the other hand , shared memory or block memory would insert automatically to the local’s list.

Here is a screen shot of the kernel , before debugging:

Here is a screen shot during debugging

I evoke my kernel function with following call:

checkCUDA<<<1, 32>>>(sumMat->rows,sumMat->cols , (UINT *)pGPUsumMat); 

	cudaError = cudaGetLastError();

	if(cudaError != cudaSuccess)

	{

		printf("CUDA error: %s\n", cudaGetErrorString(cudaError));

		exit(-1);

	}

kernel call works without an error.

Is there any option to forcing nsight stops at all breakpoints ?

How can I add thread’s registers to my watch list ?

Any help would be appreciated

I can post my code on demand

Cheers

P.S how can I convert this post to nsight forum , somehow I can’t post messages on nsight forum.

Ailleur · May 31, 2012, 12:41am

May be basic, but are you launching your application through nsight->start cuda debugging?

TripleS · May 31, 2012, 5:08am

Hep , sure … that’s what am I doing …

I just updated my post , Please watch my screen shots

Gilles_C · May 31, 2012, 6:15am

Hi,
I’m not sure of the specifics of your own environment and the particularities of GPU debugging, but generally speaking, a debugger can only honour breakpoints at actual code statements. And a variable declaration is not such a statement. Those declarations are simply syntactical sugar the programming language requires, but are not translated into machine code.
So apparently, the debugger allows you to set meaningless breakpoints prior to run the code, and just discard them while running. The problem I see here is that those breakpoints shouldn’t have been allowed in the first place. It would be like allowing a breakpoint into a comment: not much of a hook to hang to then the code runs.

TripleS · May 31, 2012, 6:25am

Hey @Gilles_C, Thanks for help

I’m totally agree that declaration is not considered as a code for the compiler. the c++ compiler goes through the code twice - first time for declaring the scope variables , second turning the code to a machine code.

With honesty , an assignment is considered as a compiler code, just like the command x= 5; , and nsight doesn’t hold on registers code. it’s seems to me that nsight holds only on block’s variables.

S

Gilles_C · May 31, 2012, 7:52am

Right, sorry, I didn’t check to the end of the kernel.
Nonetheless, if your kernel is exactly what the screenshots show, neither “posss” nor “testDetctoinRate1” are actually used anywhere: posss can be optimised out to it’s assigned value that corresponds to the index for accessing “pGPUsumMat” and testDetctoinRate1 stores the corresponding value without using it. So actually, the compiler is free to optimise all the code out. Actually, the cuga-gdb documentation says that “-g -G” “forces -O0 compilation, with the exception of very limited deadâ€code eliminations and registerâ€spilling optimizations”. I suspect you’re in the “dead-code elimination” case. Just try to pretend doing something with the variables to prevent the compiler from removing the statements…

TripleS · June 1, 2012, 4:06pm

Right, sorry, I didn’t check to the end of the kernel.

Nonetheless, if your kernel is exactly what the screenshots show, neither “posss” nor “testDetctoinRate1” are actually used anywhere: posss can be optimised out to it’s assigned value that corresponds to the index for accessing “pGPUsumMat” and testDetctoinRate1 stores the corresponding value without using it. So actually, the compiler is free to optimise all the code out. Actually, the cuga-gdb documentation says that “-g -G” “forces -O0 compilation, with the exception of very limited deadâ€code eliminations and registerâ€spilling optimizations”. I suspect you’re in the “dead-code elimination” case. Just try to pretend doing something with the variables to prevent the compiler from removing the statements…

Thanks for help

Initially , My debug command line is as followed:

Runtime API (NVCC Compilation Type is hybrid object or .c file)

set CUDAFE_FLAGS=–sdk_dir "c:\Program Files\Microsoft SDKs\Windows\v7.0A"

“C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v4.2\bin\nvcc.exe” --use-local-env --cl-version 2010 -ccbin “C:\Program Files\Microsoft Visual Studio 10.0\VC\bin” -I"…....\opencv\modules\gpu\src\opencv2\gpu\device" -I"…....\opencv\modules\gpu\include\opencv2\gpu" -I"…....\build\include\" -G --keep-dir “Debug” -maxrregcount=0 --machine 32 --compile -g -Xcompiler "/EHsc /nologo /Od /Zi /MDd " -o “Debug%(Filename)%(Extension).obj” “%(FullPath)”

I changed on property page → cuda → host → generate hosting debug information → No

Now my command line doesn’t contain the -g and -O letters , my command line is as followed:

Runtime API (NVCC Compilation Type is hybrid object or .c file)

set CUDAFE_FLAGS=–sdk_dir "c:\Program Files\Microsoft SDKs\Windows\v7.0A"

“C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v4.2\bin\nvcc.exe” --use-local-env --cl-version 2010 -ccbin “C:\Program Files\Microsoft Visual Studio 10.0\VC\bin” -I"…....\opencv\modules\gpu\src\opencv2\gpu\device" -I"…....\opencv\modules\gpu\include\opencv2\gpu" -I"…....\build\include\" -G --keep-dir “Debug” -maxrregcount=0 --machine 32 --compile -Xcompiler "/EHsc /nologo /Od /Zi /MDd " -o “Debug%(Filename)%(Extension).obj” “%(FullPath)”

although, I do debug with -o , does it matter ?

It doesn’t make any change

Topic		Replies	Views
Nsight version 2023.1 for Visual Studio 2022 hits breakpoints incorrectly Nsight Visual Studio Edition cuda , nsight	0	880	April 5, 2023
How to set breakpoints Nsight 5.3/CUDA 9.0/Visual Studio 2017? Nsight Visual Studio Edition	12	3931	December 3, 2018
Debugging cuda code using visual studio CUDA Programming and Performance	23	73676	December 20, 2011
Nsight Visual Studio Ignoring Break Points Nsight Visual Studio Edition	6	2439	November 3, 2016
NSight 5.5 skips all the breakpoints in visual studio 2015 Nsight Visual Studio Edition	7	1258	March 19, 2018
Can't debug using Nsight, kernel code works fine Kernel call works fine , when trying to debug u CUDA Programming and Performance	14	3855	August 22, 2018
RE : NSight skipping breakpoints while debugging Nsight Visual Studio Edition	4	1287	May 1, 2013
Nsight unresponsding to break points Nsight Visual Studio Edition	9	3560	April 8, 2013
Nsight 5.2/VS2012 C++/CUDA 7.5 Crash Nsight Visual Studio Edition	20	2442	March 23, 2017
Cuda nsight debugger wont stop at any breackpoint Nsight Eclipse Edition	1	3605	March 18, 2016

Nsight skips (ignores) over break points in VS10 Cuda works fine, nsight consistently skips over sev

Runtime API (NVCC Compilation Type is hybrid object or .c file)

Runtime API (NVCC Compilation Type is hybrid object or .c file)

Related topics