CUDA 6: Simplest Sample Segmentation Fault

sanek_dampir · February 15, 2014, 12:46pm

I get access to CUDA 6 RC as register developer and I want try to use new feature of CUDA 6: Unified Memory. So, I created simple example when I try use this feature: Here is me example:

#include <stdio.h>
#include <cuda_runtime.h>

int
main(void)
{
int numElements = 5000;
size_t size = numElements * sizeof(float);
float *a;
cudaMallocManaged(&a, numElements);

for (int i = 0; i < numElements; ++i)
 {
     a[i] = rand()/(float)RAND_MAX;
}


return 0;

}
I tried run it example, but I got segmentation fault error:

Segmentation fault: 11

Question - what I doing wrong?

adamjmac · February 15, 2014, 7:30pm

I have the same problem. cudaMallocManaged returns and the pointer is NULL. I have tried with -arch=sm_20, 30, and 35, and I am using a GTX780. I also cannot find any official documentation for this function.

Edit: I found the documentation included in the toolkit installer. It states:
Unified Memory has three basic requirements:

a GPU with SM architecture 3.0 or higher (Kepler class or newer)
a 64-bit host application and operating system, except on Android
Linux or Windows

I’m running on Windows in 32-bit mode so that’s probably it.

adamjmac · February 15, 2014, 9:53pm

Yeah, I tried in 64-bit mode and that solved the issue.

SPWorley · February 16, 2014, 2:52am

Your malloc is still wrong… it’s allocating number of elements, not the byte size. It should be

cudaMallocManaged(&a, size);

And you should also check for success to be pedantic.

aimjwizards · March 20, 2014, 6:35pm

I have problem with cudaMallocManaged() as well.

if I use cudaCheckErrors(cudaMallocManaged(&a, size)), the return error is unknown 71
After cudaMallocManaged(), the value of a is NULL.
My machine has two Tesla K20c, and is a 64-bit Linux host.
The CUDA driver version is NVIDIA-SMI 331.49 Driver Version: 331.49
Even the SDK examples that use cudaMallocManaged() didn’t go through.

samples/0_Simple/UnifiedMemoryStreams/UnifiedMemoryStreams.cu

samples/7_CUDALibraries/conjugateGradientUM/main.cpp:

Is there anyone who has encountered similar problems?

Robert_Crovella · March 21, 2014, 2:33am

What is your OS?

aimjwizards · March 21, 2014, 5:41pm

$ uname -a
Linux ***.edu 2.6.32-431.5.1.el6.x86_64 #1 SMP Fri Jan 10 14:46:43 EST 2014 x86_64 x86_64 x86_64 GNU/Linux

aimjwizards · March 24, 2014, 9:07pm

Hi,

It’s RHEL 6.

Elaheh · March 26, 2015, 4:13pm

I have the same problem. cudaMallocManaged() returns NULL after some successful allocation.
Is there any memory limitation on cudaMallocManaged except global memory size? I am just allocating some MBs and then get null pointer!

Elaheh · March 26, 2015, 4:52pm

I am using Tesla K20c.

Elaheh · March 26, 2015, 5:09pm

.