Can I use "operator new" in device code?

fastinvsqrt · April 12, 2015, 10:38pm

After some quick searching, it seems to me that I should be able to use “operator new” in device code if I am compiling with at least compute 2.0. However, I have tried with both 2.0 and 3.0, but I am still getting an error about calling a host function (“operator new”) from a device function. Am I doing something wrong, or can I just not use “new” and “delete” in device code?

Should I just use malloc and free in my device code? Or can I just do something like this?

__device__ void* operator new(size_t bytes)
{
    return malloc(bytes);
}

__device__ void operator delete(void* mem)
{
    free(mem);
}

Robert_Crovella · April 13, 2015, 2:10am

You can just use new and delete in device code. (Try it!)

You don’t need to provide those definitions for ordinary usage.

If you want to overload new for a specific purpose, you can do that also. An example is discussed here:

http://devblogs.nvidia.com/parallelforall/unified-memory-in-cuda-6/

(although that example is really host-side new)

Here’s a fully worked example of device new:

$ cat t716.cu
#include <stdio.h>
#define DSIZE 32
__global__ void kernel(){

  int *a = new int[DSIZE];
  for (int i = 0; i < DSIZE; i++) a[i] = i;
  for (int i = 0; i < DSIZE; i++) printf("%d ", a[i]);
  printf("\n");
}

int main(){

  kernel<<<1,1>>>();
  cudaDeviceSynchronize();
}
$ nvcc -o t716 t716.cu
$ cuda-memcheck ./t716
========= CUDA-MEMCHECK
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
========= ERROR SUMMARY: 0 errors
$

And as you point out, just like usage of device malloc, device new requires a cc2.0 device or higher.

fastinvsqrt · April 13, 2015, 3:10pm

I did try it, and I was getting those errors :/ I overloaded the new and delete operators for the classes I needed to allocate on the device, though, and the errors went away.

Topic		Replies	Views
New operator in device functions CUDA Programming and Performance	1	2501	May 21, 2017
Using cudamalloc in device function CUDA Programming and Performance	4	7817	September 27, 2013
How to use malloc and free inside the device version 3.2 using malloc and free inside the device ker CUDA Programming and Performance	6	23987	October 23, 2010
Overloading new[] doesn't work in device code CUDA Programming and Performance	4	8761	December 22, 2010
malloc-ing in either host or device code CUDA Programming and Performance	2	2252	February 5, 2009
Problems calling new operator in CUDA 4.0 CUDA Programming and Performance	4	11416	May 18, 2013
Segmentation fault when calling virtual function on host CUDA Programming and Performance	9	2481	September 10, 2019
may i malloc mem on device. CUDA Programming and Performance	2	2079	December 30, 2008
Dynamic memory allocation during kernel execution Is it posible? CUDA Programming and Performance	13	169401	January 25, 2013
Not working correctly new () and malloc () inside the kernel, why? CUDA Programming and Performance	2	1252	April 4, 2014

Can I use "operator new" in device code?

Related topics