memory allocating in device function How?

upyachkaman · October 27, 2008, 3:33pm

Hello!
Can i allocate any memory during GPU calculations in device function???

tmurray · October 27, 2008, 4:47pm

device functions have the same restrictions as global functions (so all global memory has to be allocated from yoru host code).

upyachkaman · October 27, 2008, 6:32pm

it involves allocating very large values of memory for some issues :(
for example, i need to do some actions with large matrices and i don’t know how much large will a resultant matrix, i. e. i need constantly relocate additional memory blocks to a resultant matrix. do you know how resolve this problem?

tmurray · October 27, 2008, 6:43pm

either preallocate a very large amount of GPU memory or figure out how much memory you’re going to need in one kernel, return values to the host, allocate the appropriate amount of memory, and then launch another kernel.

upyachkaman · October 27, 2008, 6:57pm

The case with preallocating a very large amount of GPU memory is mismatching so as it is very inefficiently using of memory and for a very large matrices memory can be not enough.

Returning values and launching another kernel take a long time :( then implementing on CPU will more effective.

alex_dubinsky · October 28, 2008, 2:47am

Preallocating is a good solution. Instead of issuing multiple cudaMalloc() calls for all your arrays (whose sizes I’m guessing may change), issue a single, very large cudaMalloc() call. Then you can do with the block of memory anything you wish, split it up any way you like, and allocate from it in device code (depending on your needs, you can even use global memory atomics to create a full, robust allocator).

Topic		Replies	Views
Allocating space in global memory from device CUDA Programming and Performance	1	995	May 10, 2009
malloc/realloc on __device__ CUDA Programming and Performance	4	3938	February 10, 2010
Memory allocation by GPU -> Possible in future releases? CUDA Programming and Performance	5	3765	March 13, 2009
question about memory allocation CUDA Programming and Performance	1	1618	October 16, 2007
allocate memory in device code? CUDA Programming and Performance	1	994	June 13, 2009
dynamically global memory allocation in __global__ or __device__ function? CUDA Programming and Performance	2	6302	November 17, 2009
MultiGPU example in the CUDA SDK some stack problems CUDA Programming and Performance	5	3124	March 11, 2018
How can we allocate memory dynamically in __device__ functions? CUDA Programming and Performance	2	1563	March 6, 2009
Memory allocation from Device onto Device (Global) Memory CUDA Programming and Performance	2	1209	February 22, 2009
How to increase dynamically allocatable memory in device function? CUDA Programming and Performance	2	2990	November 20, 2018

memory allocating in __device__ function How?

Related topics

memory allocating in device function How?