Memory fragmentation

eyalhir74 · October 13, 2009, 8:19am

Hi,
This is probably a retoric question, but… Is there a way to control where a cudaMalloc would allocate the data relative to offset zero in GPU RAM?
The problem is that I have very big data to allocate on the GPU, consider the following scenerio:

Allocate 1.5GB pointerA.
Allocate 700MB pointerB.
Allocate 700MB pointerC.
Allocate 700MB pointerD.
Allocate some various small size pointers.

For a C1060 that should fit, however depending on the positions of the arrays in the 4GB address space, it might fail.
Is there a way to ensure this would fit into memory other than making the arrays smaller by dividing them to chunks???

thanks
eyal

_teju · October 13, 2009, 8:29am

When we cannot control the location where the malloc happens on CPU (can we?), how justified is it to demand the same from GPU?

avidday · October 13, 2009, 8:31am

I have come to the conclusion that the most reliable way to get this done is to allocate every last byte of free memory (or at least as much as your “big” storage needs require) on the device in an initialization stage at the beginning of the code, and then manage the division of chunks of memory out of that initial allocation yourself. The card/driver maintains a number of different page sizes which can result in all sorts of odd fragmentation and “lost” memory, to the point where single byte allocations in what appears to be “high” memory space, actually allocates complete 64kb pages.

EDIT: I did have a play around with the issue of fragmentation a while ago and posted by results in this thread [url=“http://forums.nvidia.com/index.php?showtopic=106167&view=findpost&p=586012”]http://forums.nvidia.com/index.php?showtop...st&p=586012[/url]

_teju · October 13, 2009, 8:37am

Some of the ways to ‘minimize’ memory fragmentation (which I generally follow on CPU) are:

Allocating memory in large chunks.
Allocating memory (preferably) in sizes of the form 2^n.
If none of the above solve this issue, try to come-up with a custom memory allocation function!!

eyalhir74 · October 13, 2009, 8:49am

Thank you both for the fast replies :)

I feared that would be the answer.

@avidday - allocating all memory is a bit problematic to me I guess. First because textures are limited in size and part of this

“huge” array needs to be accessed by textures. Also a big dataset input won’t fit into memory in anycase (even into 4GB) so

I’ll have to juggle around a lot and don’t know if its doable… I’ll test it though…

@teju - custom memory allocation function is what I was thinking when I asked the original question. I guess on CPU you can

do that, but with CUDA not. Is there something like this in CUDA?

thanks

eyal

_teju · October 13, 2009, 8:57am

I’ve seen loads of custom memory alloc functions for CPU, but none for GPU, till now. The main issue with coming up with such a function is that it has to provide allocation routines for different kinds of memories, 1D, 2D-pitch, 3D, memory arrays, texture, constant memory… !!!

Topic		Replies	Views
memory allocation problem CUDA Programming and Performance	2	4822	September 8, 2009
GPU Allocating memory Memory allocation on GPU CUDA Programming and Performance	2	4692	April 23, 2009
Memory allocation reliablity CUDA Programming and Performance	8	3234	August 18, 2008
How to solve memory allocation problem in cuda?? CUDA Programming and Performance	4	31192	February 2, 2015
Memory allocation by GPU -> Possible in future releases? CUDA Programming and Performance	5	3794	March 13, 2009
Splitting large datasets to fit into device memory Algorithm, implementation, and problems CUDA Programming and Performance	1	7437	April 25, 2009
cudaMalloc segfaulting Possible cause? CUDA Programming and Performance	7	4038	September 26, 2008
Cuda memory size and types allocating large chunks of memory CUDA Programming and Performance	10	2006	August 9, 2010
Do we need to defrag the card memory? CUDA Programming and Performance	1	1580	February 15, 2008
cuMemAlloc questions CUDA Programming and Performance	1	2430	January 29, 2010

Memory fragmentation

Related topics