Problem CudaMallocHost

mons91 · July 14, 2015, 10:17am

Hi everybody,
trying to use the CudaMallocHost function I encountered a problem, that is, if I allocate Host memory using CudaMallocHost in my main and then I pass the variable to a function in a .cu file where I execute my Cudamemcpy the overall program crashes, but if I substitute the CudaMallocHost with a simple malloc everything goes fine…Do you have any idea to solve this problem?

Thanks!

nikkadim · July 14, 2015, 1:20pm

cudaError_t cudaMallocHost (void ** ptr, size_t size )

Check for cudaError_t, may be it’s not possible to allocate this amount of data in PINNED memory.

mons91 · July 14, 2015, 2:02pm

unfortunately the cudaMallocHost doesn’t return any error.

Robert_Crovella · July 14, 2015, 3:34pm

I guess the problem is in something you haven’t described. Based on your description, the following test case seems to work fine for me:

$ cat t842.cpp
#include <cuda_runtime.h>
#define DSIZE 100000
void cudaTest(int *data, size_t dsize);

int main(){

  int *data;
  cudaMallocHost(&data, DSIZE*sizeof(int));
  for (int i = 0; i < DSIZE; i++){data[i] = i;}
  cudaTest(data, DSIZE);
  return 0;
}


$ cat t842.cu
#include <stdio.h>

#define DSIZE 10
__global__ void test(int *data, size_t dsize){
  for (int i = 0; i < dsize; i++) printf("data[%d] = %d\n", i, data[i]);
}

void cudaTest(int *data, size_t dsize){

  int *d_data;
  cudaMalloc(&d_data, dsize*sizeof(int));
  cudaMemcpy(d_data, data, DSIZE*sizeof(int), cudaMemcpyHostToDevice);
  test<<<1,1>>>(d_data, DSIZE);
  cudaDeviceSynchronize();
}
$ nvcc -o t842 t842.cu t842.cpp
$ ./t842
data[0] = 0
data[1] = 1
data[2] = 2
data[3] = 3
data[4] = 4
data[5] = 5
data[6] = 6
data[7] = 7
data[8] = 8
data[9] = 9
$

allanmac · July 14, 2015, 3:47pm

@mons91, are you performing a DeviceToDevice memcpy?

If so, you would need to use cudaHostGetDevicePointer() to obtain a device pointer to the cudaMallocHost() allocated memory.

Topic		Replies	Views
cudaMallocHost How to use CUDA Programming and Performance	6	35697	April 26, 2012
Problem with cudaHostAlloc Problem with Memcpy CUDA Programming and Performance	6	2983	July 2, 2012
Should cudaMallocHost() need retry? CUDA Programming and Performance	4	1362	January 10, 2016
cudamallochost problem CUDA Programming and Performance	6	10922	March 10, 2011
Problems with CudaFreeHost CUDA Programming and Performance	3	1031	September 1, 2015
CUDA class - allocate memory using malloc (Dynamic Global Memory Allocation and Operations) CUDA Programming and Performance	3	3202	February 2, 2017
1st call to cudaMallocHost fails... ... but next calls are OK. (!?) CUDA Programming and Performance	1	6189	January 8, 2009
What is the correct way to use cudaMallocHost to create a local array representing the GPU data? CUDA Programming and Performance	3	78	August 20, 2024
Copy pinned-memory to cuda array crash CUDA Programming and Performance	1	3307	December 22, 2011
Invalid Argument after calling cudaMalloc on device but not host CUDA Programming and Performance	1	1840	March 30, 2017

Problem CudaMallocHost

Related topics