Atomic operation Getting atomicAdd support

enjoyamalp · December 3, 2007, 11:59am

Hi,

I am using CUDA1.0 and i want to do some atomic operation in a memory location at global area. My display card is 8800GTS. I changed the custom build setup to

$(CUDA_BIN_PATH)\nvcc.exe -arch sm_11 -ccbin “$(VCInstallDir)bin” -c -DWIN32 -D_CONSOLE -D_MBCS -Xcompiler /EHsc,/W3,/nologo,/Wp64,/O2,/Zi,/MT -I"$(CUDA_INC_PATH)" -I./ -I…/…/common/inc -o $(ConfigurationName)\template.obj template.cu

But the program is giving some different output than what i expect.

kernel
global void
testKernel( int* g_odata)
{
// Block index
int bx = blockIdx.x;
int by = blockIdx.y;

// Thread index
int tx = threadIdx.x;
int ty = threadIdx.y;
if(tx==0&&bx==0)
{
  g_odata[0]=0;
}
int nBlocksize = 16;
int nStart = bx * ceil((float)65536/nBlocksize)  + tx * ceil((float)((65536/nBlocksize)/nBlocksize));
for( int i = nStart; i <= nStart+ceil((float)((65536/nBlocksize)/nBlocksize)); i=i+1 )
{
   g_odata[0] = 1.0f; 
   //__syncthreads();
}

}

host
void
runTest( int argc, char** argv)
{
CUT_DEVICE_INIT();
int* pCpuOutData = (int*)malloc( 256256sizeof(float));
int* pOutData;
CUDA_SAFE_CALL( cudaMalloc( (void**) &pOutData, 256 * 256 * sizeof(int)));
dim3 grid(16,1);
dim3 thread(16,1);
testKernel<<<grid,thread>>>(pOutData);

CUDA_SAFE_CALL( cudaMemcpy( pCpuOutData, pOutData, 256 * 256 * sizeof(int),
                cudaMemcpyDeviceToHost) );    



printf("%d",pCpuOutData[0]);
CUDA_SAFE_CALL( cudaFree(pOutData));   
free( pCpuOutData );

}

The output is some junk value like 11731320…

Please help me.

wumpus · December 3, 2007, 12:06pm

Alas, the 8800GTS doesn’t support atomic operations, you need a card with compute model 1.1 for that, like the 8600 (G86) or 8800GT (G92)

enjoyamalp · December 3, 2007, 12:07pm

is there any table which specifies the card number like 1.1 or 1.0? if so where?

Simon_Green · December 3, 2007, 1:49pm

The CUDA FAQ includes a list of supported GPUs and their compute version:
[url=“http://forums.nvidia.com/index.php?showtopic=36286”]http://forums.nvidia.com/index.php?showtopic=36286[/url]

We will be updating this for CUDA 1.1 shortly.

Topic		Replies	Views
atomic add CUDA Programming and Performance	4	4714	March 20, 2008
atomic function for 8600 and not 8800 ? for 1.1 and not 1.0 ??? CUDA Programming and Performance	1	4710	July 16, 2007
Atomic Operations on GTX 280 ? CUDA Programming and Performance	2	899	March 4, 2010
Can we use "AtomicAdd()" with GTX 8800? Any other option to do same thing...? CUDA Programming and Performance	1	2247	December 7, 2007
AtomicAdd algorithm CUDA Programming and Performance	7	3962	August 25, 2009
Atomic op available on Telsa or not? CUDA Programming and Performance	1	3765	July 11, 2007
Does Tesla support atomic operations? CUDA Programming and Performance	6	6308	April 25, 2008
atomic add for GTS 8800? CUDA Programming and Performance	3	2793	December 30, 2007
Does GTX285 support atomic operations? CUDA Programming and Performance	1	3064	October 6, 2009
AtomicAdd CUDA Programming and Performance	5	7904	April 4, 2008

Atomic operation Getting atomicAdd support

Related topics