Problem using zero-copy / mapped memory Cuda 2.2 beta

MJH22 · March 18, 2009, 3:00pm

Hi,

I’m trying to use the memory mapping feature of Cuda 2.2 beta. My code is:

#include “cuda/cutil.h”

[codebox]

int main(void) {

float4 *ptr_h, *ptr_d;

CUDA_SAFE_CALL( cudaSetDevice(0) );

CUDA_SAFE_CALL( cudaSetDeviceFlags( cudaDeviceMapHost ) );

CUDA_SAFE_CALL( cudaHostAlloc( (void**) &(ptr_h), sizeof(float4) * 30000, cudaHostAllocMapped | cudaHostAllocPortable ) );

CUDA_SAFE_CALL( cudaHostGetDevicePointer( (void**) &(ptr_d), ptr_h, 0 ) );

}

[/codebox]

This gives the error:

[codebox]

Cuda error in file ‘f.cu’ in line 10 : unspecified launch failure in prior launch.

[/codebox]

which is the HostGetDevicePointer() call.

I have the beta release 185 driver, a Red Hat 5.3 x86-64 system and a Tesla C1060.

Any ideas what I’m doing wrong?

Cheers,

Matt

MisterAnderson42 · March 18, 2009, 6:08pm

It probably won’t make a difference, but have you tried putting the setDeviceFlags before the setDevice call? The manual does say:

but I don’t really thing that cudaSetDevice is a “CUDA operation”…

I’m at a conference, otherwise I would try it out myself.

tmurray · March 18, 2009, 6:48pm

Will poke at this in the afternoon.

wumpus · March 18, 2009, 7:36pm

Cool, if you get this to work please report the speed gains you get compared to copying.

mfatica · March 18, 2009, 8:27pm

It works just fine for me ( 2.2 beta, RHEL4 64bit)

#include "cuda_runtime.h"

int main(void) {

  float4 *ptr_h, *ptr_d;

  int cudaError;

cudaSetDevice(0);

   cudaSetDeviceFlags( cudaDeviceMapHost );

   cudaError=cudaHostAlloc( (void**) &(ptr_h), sizeof(float4) * 30000, cudaHostAllocMapped | cudaHostAllocPortable );

   if (cudaError) printf ("Failed to allocate pinned memory \n");

cudaError=cudaHostGetDevicePointer( (void**) &(ptr_d), ptr_h, 0 );

   if (cudaError) printf ("Failed to get device pointer \n");

}

Compiled both with gcc ( gcc -I/usr/local/cuda/include bug.c -L/usr/local/cuda/lib -lcudart) or nvcc (nvcc bug.c).

BTW using cutil is usually a bad idea, the error checks will go away in release mode.

MJH22 · March 19, 2009, 12:09pm

Sage words: the CUDA_SAFE_CALL macro I was using (derived originally from cutil) was broken.

Your example - and mine with corrected macro - work just fine now, thanks!

Matt

Topic		Replies	Views
About cudaHostGetDevicePointer problem CUDA Programming and Performance	1	8973	March 30, 2009
mapped memory with C1060 CUDA Programming and Performance	3	643	October 25, 2010
can I use pinned memory? CUDA Programming and Performance	6	2698	September 21, 2009
cudaHostAlloc on 2.2 CUDA Programming and Performance	3	1154	June 9, 2009
device memory collision from cudaHostGetDevicePointer CUDA Programming and Performance	6	844	April 8, 2011
error in using cuda mapped memory a test program for mapped memory CUDA Programming and Performance	0	1005	April 20, 2010
cudaHostGetDevicePointer() not working? CUDA Programming and Performance	1	2441	July 9, 2009
Problem with canMapHostMemory, CUDA SDK 2.2 Erroneous reporting of canMapHostMemory in CUDA SDK 2.2 CUDA Programming and Performance	0	795	June 23, 2009
cudaHostAlloc --> invalid argument it works with fermi, not with 9800gtx CUDA Programming and Performance	5	6662	November 11, 2011
mapped memory with C1060 CUDA Programming and Performance	1	513	October 24, 2010

Problem using zero-copy / mapped memory Cuda 2.2 beta

Related topics