cuda malloc managed fails

trinayan · May 18, 2017, 3:12pm

Hi,

I am trying to run some applications that use cuda malloc managed. and I get an error that is not supported. What is the issue? Pascal should support this as per nvidia documentation

trinayan · May 18, 2017, 3:33pm

Maybe I am compiling with wrong arch sm flag and it is not generating the correct binary. What is the arch sm flag for jetson tx2?

kayccc · May 19, 2017, 2:20am

Hi trinayan

For TX2 Pascal GPU architecture, the Compute Capability is 6.2.

Thanks

NataljaNeumann · November 16, 2018, 4:18pm

Hello,

I’m trying to build and run the first CUDA application from An Even Easier Introduction to CUDA | NVIDIA Technical Blog

I have an MX150 GPU, in combination with Intel GPU. Windows 10 told the signature of the installed driver would be incorrect, so I reverted to the previous Windows driver, but this only as a side note.

Now I’m trying to build and run this first CUDA app, and I found that x and y are NULL after the execution of cudaMallocManaged. I changed the code of the main method to:

std::cout << "Test started " << std::endl;

int N = 1 << 20; // 1M elements

float *x, *y;
cudaMallocManaged(&x, N * sizeof(float));
cudaMallocManaged(&y, N * sizeof(float));

std::cout << "Test started2 " << (x!=NULL?"x present":"x missing") << "," << (y!=NULL?"y present":"y missing") << " 0x" << x <<" 0x" << y << std::endl;

// initialize x and y arrays on the host
for (int i = 0; i < N; i++) {
	x[i] = 1.0f;
	y[i] = 2.0f;
}

std::cout << "Values have been set" << std::endl;

// Run kernel on 1M elements on the GPU
add<<<1, 1>>>(N, x, y);
std::cout << "Add started on GPU " << std::endl;

// Wait for GPU to finish before accessing on host
cudaDeviceSynchronize();
std::cout << "Add finished on GPU " << std::endl;

// Check for errors (all values should be 3.0f)
float maxError = 0.0f;
for (int i = 0; i < N; i++)
	maxError = fmax(maxError, fabs(y[i] - 3.0f));
std::cout << "Max error: " << maxError << std::endl;

// Free memory
cudaFree(x);
cudaFree(y);

return 0;

The execution log:

Test started
Test started2 x missing,y missing 0x0000000000000000 0x0000000000000000

Any Ideas?

NataljaNeumann · November 17, 2018, 7:52am

Ok, I found that I need a new driver from the nvidia drivers page, not the one that was provided by the device manufacturer and not the one that comes with CUDA toolkit on the page CUDA Toolkit 11.7 Update 1 Downloads | NVIDIA Developer

With the WHQL driver it works. Just wanted to tell you…

Topic		Replies	Views
cudaMallocManaged error on my machine CUDA Programming and Performance	3	3899	October 23, 2014
Calling cudaMallocManaged always returns null, but only if a cu file CUDA Programming and Performance	2	794	November 9, 2017
cudaMallocManaged() not working CUDA Programming and Performance	1	2390	November 18, 2018
cudaMallocArray and 9800GTX CUDA Programming and Performance	0	2296	June 17, 2008
CUDA 6: Simplest Sample Segmentation Fault CUDA Programming and Performance	10	5093	March 26, 2015
Problem report cudaMalloc() returning "cudaErrorUnknown" CUDA Programming and Performance	16	8970	January 22, 2009
bug in CUDA initialization? simple code cant see the device after xxx runs CUDA Programming and Performance	10	7858	June 23, 2009
unspecified driver error in cudaMalloc CUDA Programming and Performance	0	2895	February 25, 2008
how can I check if a GPU supports managed memory model? CUDA Programming and Performance	1	871	December 15, 2018
Errors trying to run the samples "cudaMalloc failed" CUDA Programming and Performance	3	3882	February 9, 2009

cuda malloc managed fails

Test started Test started2 x missing,y missing 0x0000000000000000 0x0000000000000000

Related topics

Test started
Test started2 x missing,y missing 0x0000000000000000 0x0000000000000000