How to solve memory allocation problem in cuda??

anik · February 1, 2015, 4:23pm

Fatal error: Failed to allocate device buffer. (out of memory at …/src/programname:linenumber

My 3D array is 20 X 200 X 200 and for each value in an array it returns 1331 outcomes (one for location and one for difference). Hence, I have to pass total 3 arrays to GPU of which one is of size 20 X 200 X 200 and other two are 20 X 200 X 200 X 1331.

So, I think this much memory allocation is not possible in GPU memory. So, is there any other way to handle this problem???

I have searched in the internet and couldn’t find any satisfactory solution. Here I would like to mention that I am using CUDA 6 version, Linux 64 bit operating system with 64 GB Ram support and 2TB hard disk support. I have checked memory status and it shows only 3% is in use. I have used CUDAMalloc function to allocate memory. But still this problem arises. So, anybody if could help me out in this regard will be very much helpful. Thanks in advance.

inJeans · September 25, 2015, 10:41am

If we assume your arrays are 32bit floats one of your four dimensional matrices would require

20*200*200*1331*32bit = 4.3 GB

So it really depends on the hardware you are using. Newer Tesla cards have between 12-24 GB of memory, while older cards might only have 2-4GB.

The common approach to getting around this problem is to break it up into batches. If, as you say, you are simply looking for differences, you could pass in small sections of your array at a time and operate on each section individually.

If your dataset is too large to fit on the device, there isn’t a lot more you can do.

Topic		Replies	Views
How to solve memory allocation problem in cuda?? CUDA Programming and Performance	4	30230	February 2, 2015
CUDA memory allocation problem CUDA Programming and Performance	1	480	July 22, 2016
GPU Allocating memory Memory allocation on GPU CUDA Programming and Performance	2	4641	April 23, 2009
How to solve memory allocation problem in cuda?? CUDA Setup and Installation	0	571	February 1, 2015
cudaMalloc3DArray out of memory can not allocate the available amount of memory CUDA Programming and Performance	3	1808	January 31, 2011
How is 4GB addressable on 32bit? CUDA Programming and Performance	10	9227	August 21, 2009
Cannot allocate "all" memory? cudaMalloc fails with 50MB memory left.. CUDA Programming and Performance	9	9571	July 15, 2008
How much GPU memory can cudaMalloc get? CUDA Programming and Performance	17	15088	April 2, 2022
Maximum memory allocation size CUDA Programming and Performance	7	16390	January 24, 2012
maximum allocation size on windows 7 CUDA Programming and Performance	2	11242	April 1, 2010

How to solve memory allocation problem in cuda??

Related topics