CUDA fails to allocate large chunk of memory

charlesaverill20 · March 23, 2022, 3:33pm

I’d like to allocate a big block of memory (roughly 33MB) on the device. I’ve been using malloc as this code runs one time, and therefore doesn’t really need to be performant, however I’ve tested this with cudaMalloc as well.

canvas->antialiasing_samples = DO_ANTIALIASING ? ANTIALIASING_SAMPLES : 1;
int size_multiplier = canvas->antialiasing_samples * canvas->width * canvas->height;
printf("%d\n", sizeof(Vector<int> *) * size_multiplier); //33554432
printf("%x\n", (Vector<int> **)malloc(sizeof(Vector<int> *) * size_multiplier)); // 0
canvas->antialiasing_colors_array = (Vector<int> **)malloc(
    sizeof(Vector<int> *) * canvas->antialiasing_samples * canvas->width * canvas->height); // NULL

Setting size_multiplier to a smaller number in the kilobytes range works just fine. Is there some upper limit on malloc calls, or is there something else prohibiting me from allocating this much memory? I’m on an RTX 2070 with 8GB of memory, so this seems unusual to me.

Robert_Crovella · March 23, 2022, 3:43pm

yes, in-kernel malloc (or in-kernel new, or in-kernel cudaMalloc) is limited to the size of the device heap, which is adjustable. This is a commonly asked question, so you can find many write-ups on it, but the documentation covers everything you need to know. I recommend reading all of that section (B.33), including all of the sub-sections (B.33.1, B33.2, etc.)

system · April 6, 2022, 3:44pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unable to allocate more than 2MB using malloc in CUDA kernel CUDA Programming and Performance cuda , kernel	4	1469	April 8, 2020
kernel malloc() capacity limited? can only malloc 88K blocks, more malloc() will fail CUDA Programming and Performance	2	6116	January 15, 2011
malloc can't allocate more than 8Mb from the __device__ function, 6Gb available. CUDA Programming and Performance	4	1570	February 13, 2015
Dynamic Memory Allocation inside kernel Can we have a cudaMalloc((void**)&var, size) in our ke CUDA Programming and Performance	1	1504	February 9, 2010
Allocating large arrays. CUDA Programming and Performance	6	3767	October 25, 2009
Memory fragmentation CUDA Programming and Performance	5	6797	October 13, 2009
Memory allocation problem CUDA Programming and Performance	2	4621	April 16, 2009
CUDA memory allocation problem CUDA Programming and Performance	1	490	July 22, 2016
How much GPU memory can cudaMalloc get? CUDA Programming and Performance	17	15182	April 2, 2022
cudaMalloc3DArray out of memory can not allocate the available amount of memory CUDA Programming and Performance	3	1813	January 31, 2011

CUDA fails to allocate large chunk of memory

Related topics