Potential block size bug

papag62 · October 8, 2007, 4:19pm

First off, I am completely new to parallel programming. For a MPI class I am taking we wrote a program to calculate pi by finding the area under a curve using the midpoint rule. I decided to do the same using CUDA to compare results with the 64 node beowulf cluster we have at school.

I had the block size as (512, 1, 1) with a grid size of (1,1,1) just to get started. Everything worked fine and posted a result in 0.3 ms to compute using 1,000,000 panels. But then I changed the block size around and accidentally had a size of (513,1,1). The code ran and posted a result in 0.05 ms.

So I showed 6x performance by going to a block size that isn’t supported. Should the compiler not allow me to set a block size of 513? Why did I show an increase in performance?

System Information
CUDA Toolkit and SDK 1.0
XP Pro SP2, VS2005
AMD64 3700+, 512 MB
Quadro 4600

Any insight would be appreciated!

mfatica · October 8, 2007, 5:11pm

The kernel failed to launch. If you check the result, it will be wrong.

We are improving the error reporting/detection.

papag62 · October 8, 2007, 6:42pm

That’s what I figured, just wanted to make sure that Nvidia knew that this was happening.

Topic		Replies	Views
block size CUDA Programming and Performance	6	5817	July 21, 2013
unspecified launch failure problem CUDA Programming and Performance	1	8319	February 24, 2010
Is this Correct? CUDA Programming and Performance	5	3039	May 21, 2009
Launching Kernel Fail CUDA Programming and Performance	15	3390	May 28, 2014
CUDA kernels keep on crashing CUDA Programming and Performance	6	3640	October 27, 2008
Code does not run with larger file CUDA Programming and Performance	2	870	October 17, 2017
Limit on the size of data that can be processed by a kernel Newbie question CUDA Programming and Performance	2	1347	January 16, 2009
Strange Error in cuda CUDA Programming and Performance	2	487	March 6, 2017
Effective Bandwidth Problem CUDA Programming and Performance	13	7706	March 23, 2011
CUDA gives wrong result for large number of points/block CUDA Programming and Performance	3	2708	February 25, 2009

Potential block size bug

Related Topics