cudaMemcpy max bytes size?

BigJoe1 · December 10, 2008, 6:34pm

when i have 0xffffffff floats in an array, cudaMemcpy fails. Does it have a maximum size that its allowed to copy? cudaMalloc is able to handle that size but not cudaMemcpy?
Thanks in advance

int main(void)
{
float* a_h, b_h; // pointers to host memory
float a_d; // pointer to device memory
int i, N = 0xffffffff;

// allocate arrays on host
a_h = (float*)malloc(sizeof(float)*N);
b_h = (float*)malloc(sizeof(float)*N);

//allocate array on device
CUDA_SAFE_CALL(cudaMalloc((void**)&a_d, sizeof(float)*N));

// initialization of host data
for (i=0; i<N; i++) a_h[i] = (float)i;

//copy data from host to device
CUDA_SAFE_CALL(cudaMemcpy(a_d, a_h, sizeof(float)*N, cudaMemcpyHostToDevice));

// do calculation on host
incrementArrayOnHost(a_h, N);

//check assert to see if we get results expected
for (i=0; i<N; i++) assert(a_h[i] == i+1);

/*
// do calculation on device:
// Part 1 of 2. Compute execution configuration
int blockSize = 4;
int nBlocks = N/blockSize + (N%blockSize == 0?0:1);

// Part 2 of 2. Call incrementArrayOnDevice kernel
incrementArrayOnDevice <<< nBlocks, blockSize >>> (a_d, N);

// Retrieve result from device and store in b_h
cudaMemcpy(b_h, a_d, sizeof(float)*N, cudaMemcpyDeviceToHost);

// check results
for (i=0; i<N; i++) assert(a_h[i] == b_h[i]);

*/

// cleanup
free(a_h); free(b_h); CUDA_SAFE_CALL(cudaFree(a_d));

system("pause");

}

tmurray · December 10, 2008, 6:36pm

wait, how are you going to copy 16 gigs worth of floats anywhere…?

icegibbon · December 10, 2008, 6:47pm

Even if you wanted to copy 16GB of data, the expression sizeof(float)*N
will not give you that result because N is really -1 signed integer.

MisterAnderson42 · December 10, 2008, 6:51pm

Are you sure? What super GPU do you have with more than 16 GiB of memory?

Perhaps you are compiling in release mode, thus the CUDA_SAFE_CALL you have around the cudaMalloc is ignoring the error it is most certainly generating.

BigJoe1 · December 10, 2008, 6:54pm

sorry guys, i meant to say cudaMalloc isnt able to work, also, is it really trying to allocate 16gb?

tmurray · December 10, 2008, 6:59pm

I think that size_t is unsigned int on 32-bit platforms and unsigned long long int on 64-bit platforms, so it’s 4*2^30 floats multiplied by sizeof(float). That’s 4 billion times 4, so yeah, 16 gigs. Of course it doesn’t work.

(ps I doubt your malloc calls are working either…)

icegibbon · December 10, 2008, 7:21pm

64 bit compile:

#include <iostream>

int main()

 {

	 int x = 0xffffffff;

	 std::cerr << (sizeof(float)*x) << "\n";

	 return 0;

 }

Output:

18446744073709551612

alex_dubinsky · December 10, 2008, 8:08pm

Imagine a GPU with so much memory, the decimal system is useless upon it…

Topic		Replies	Views
cudaMemcpy error CUDA Programming and Performance	6	6269	December 29, 2014
cudaMemcpy for very large float arrays CUDA Programming and Performance	1	2056	June 9, 2008
cudaMemcpy max size? CUDA Programming and Performance	2	12020	July 4, 2008
strange behavior of data size in cudaMalloc or cudaMemcpy CUDA Programming and Performance	2	4986	February 9, 2009
What is maximum buffer size for cudamemcpy(), can it be modified ? CUDA Programming and Performance	3	2658	August 11, 2016
Newbie Problem with a small test-program --- CUDA Programming and Performance	3	1192	February 18, 2009
cudaMemcpy Strange behaviour CUDA Programming and Performance	2	1450	April 8, 2010
Memory or pointer size too big to fit in 32Btis Cuda error in cudaMemcpy() CUDA Programming and Performance	4	1115	September 15, 2010
cudamemcpy minimum size is there a minimum size CUDA Programming and Performance	7	7942	February 9, 2011
unspecified launch error from cudaMemcpy CUDA Programming and Performance	2	9502	August 23, 2007

cudaMemcpy max bytes size?

Related topics