cudaMalloc and threads "invalid device pointer" error

christoph · June 22, 2007, 9:36am

hi all,

I have a problem concerning memory allocation on the card. my main program allocates memory on the card for my input data. it also copies the input data to the cards memory. the pointer I got from cudaMalloc (in the main program) is then passed as a parameter to a host-thread that starts the kernel on the device. the problem is, that the kernel reports an “invalid device pointer”. if I move the cudaMalloc stuff from the main program to the thread that launches the kernel, everything is fine.
are there any restrictions concerning cudaMalloc and different threads?

thanks in advance and best regards,
christoph

hqyang · June 22, 2007, 1:11pm

I have the same question.

Is it possible to share the device memory between host threads? even between host processes?

If cudaMalloc return a pure pointer, it should be ok if the host threads or processes use the same board.

Any idea or advice, please.

christoph · June 22, 2007, 1:55pm

I assume that it is not possible. in my tests it doesn’t works. it seems, that every thread has its own, lets call it “cuda context”. in one thread I used cuMemGetInfo() to get the memory information. the result was that my card (gtx 8800) has 18MB but I was able to allocate 128MB with cudaMalloc(). nice ;-)

curryml · June 22, 2007, 5:29pm

Hello all,

A CUDA context is like a CPU process, in that each has its own state, memory space, etc. There is a 1-to-1 correspondence between CPU threads/processes and CUDA contexts, so you cannot have multiple threads per context, and you cannot have multiple contexts per threads.

This means that two host threads cannot see each other’s CUDA arrays/structures.

hqyang · June 26, 2007, 12:31am

Thank you for your explanation.

Then, my question is: how to share device memory between host threads or host process.

Of course, we can copy the GPU memory into host RAM by one thread or process, and reload it into GPU by another thread or process.

However, I do not think this is a good way.

On Microsoft Window, sharing memory between processes is possible through memory map. I hope CUDA has something like this (maybe in the future).

Topic		Replies	Views
cudaMalloc and sharing between CPU threads CUDA Programming and Performance	0	4342	May 20, 2009
CUDA + CPU threads CUDA Programming and Performance	5	11642	August 20, 2008
Referencing device memory from multiple threads CUDA Programming and Performance	2	3704	August 18, 2010
Sharing device pointers between different threads on the same GPU CUDA Programming and Performance	7	3320	August 16, 2009
Contexts and cudaMallocHost Same rules? CUDA Programming and Performance	17	11202	November 15, 2008
Data setup for multi-gpu program can't setup outside of thread? CUDA Programming and Performance	3	2766	July 20, 2007
global cuda memory and os-threads CUDA Programming and Performance	13	12319	January 21, 2009
cudaMalloc on the same pointer CUDA Programming and Performance	11	753	February 25, 2020
Device Mem sharing error between 2 thread CUDA Programming and Performance	1	3253	September 11, 2007
cudaMalloc, cudaFree from different threads CUDA Programming and Performance	6	10894	August 27, 2007

cudaMalloc and threads "invalid device pointer" error

Related topics