why does this code fail at assertion?

BigJoe1 · November 15, 2008, 8:11pm

this code fails when N >= 4096, it works fine at 2048 and lower multiples of 256, i just dont understand why this is happening.
can somebody shed some light on the matter?
thanks in advance

#include <stdio.h>
#include <math.h>
#include <assert.h>
#include <cuda.h>

global void addOne(int* A, int N)
{
int i = blockIdx.x*blockDim.x + threadIdx.x;
if (i < N) A[i] += 1;
}

int main()
{
int N = 4096;
int i = 0;
int* h_a; //host array
int* d_a; //device array

h_a = (int*)malloc(N*sizeof(int));
cudaMalloc( (void**) &d_a, N);

for (i = 0; i < N; i++) h_a[i] = 0;

cudaMemcpy(d_a, h_a, sizeof(int)*N, cudaMemcpyHostToDevice);

double numThreadsPerBlock = 256;
double numBlocks = N / numThreadsPerBlock;

addOne<<<numThreadsPerBlock, numBlocks>>> (d_a, N);

cudaMemcpy(h_a, d_a, sizeof(int)*N, cudaMemcpyDeviceToHost);

for (i = 0; i < N; i++) assert(h_a[i] == 1);

system("pause");

return 0;

}

alex_dubinsky · November 15, 2008, 10:15pm

cudaMalloc( (void**) &d_a, N);
should be
cudaMalloc( (void**) &d_a, N*sizeof(int));
:ph34r: NEXT

BigJoe1 · November 16, 2008, 12:10am

omg cant believe i overlooked that … thanks

Topic		Replies	Views
Why does my code gives wrong output when number of blocks exceed 1 CUDA Programming and Performance	1	257	June 2, 2023
basic matrix addition CUDA Programming and Performance	3	1870	March 9, 2012
Result of simple vector summation is not correct. CUDA Programming and Performance	2	779	July 23, 2013
PLease debug the code! CUDA Programming and Performance	2	438	July 15, 2011
invalid configuration argument error CUDA Programming and Performance	2	1636	June 3, 2015
CUDA gives wrong result for large number of points/block CUDA Programming and Performance	3	2717	February 25, 2009
CUDA Vector addition error. CUDA Programming and Performance	3	903	January 31, 2017
strange behavior of data size in cudaMalloc or cudaMemcpy CUDA Programming and Performance	2	4944	February 9, 2009
Failure on assert CUDA Programming and Performance	4	1279	February 22, 2020
Beginner at Cuda seg faulting CUDA Programming and Performance	0	430	August 31, 2016

why does this code fail at assertion?

Related topics