Getting the class size of non POD types for the device

kmorel · June 12, 2019, 3:23pm

I am trying to get the size of C++ classes with virtual methods so that I can use cudaMalloc to allocate space for the object on a device and call a placement new on the device. The question I have is how do I get the correct size of the object in bytes to pass to cudaMalloc?

The standard answer is to simply use sizeof(classname) and that will give you the correct size on both the host and device, and this is what we usually do. However, after encountering some errors on Windows, I came across an exception to this rule in the “Windows-Specific” section of the Classes documentation of the CUDA Programming Guide ([url]Programming Guide :: CUDA Toolkit Documentation). According to this documentation, “The CUDA compiler follows the IA64 ABI for class layout, while the Microsoft host compiler does not.” Most importantly, “The CUDA compiler may compute the class layout and size differently than the Microsoft host compiler for [certain types].” I refer back to the guide on exactly which types fall into this exception, but classes with virtual methods and multiple inheritance qualify.

So, for cases where it is possible for the size of a class to differ on the host and device, how does the host code get the size of the device version of the class?

Robert_Crovella · June 12, 2019, 6:35pm

You could launch a kernel that does sizeof(class) in kernel, and returns that data to the host.

Topic		Replies	Views
How to use class in CUDA C++? CUDA Programming and Performance	1	19449	May 29, 2018
Malloc and sizeof CUDA Programming and Performance	2	2870	May 15, 2012
cuda_sizeof() host-side calculation of device sizeof CUDA Programming and Performance	2	3923	August 13, 2008
CUDA and short size CUDA Programming and Performance	1	1360	May 22, 2020
What happens if you call cudaMalloc with size zero? CUDA Programming and Performance	5	999	April 3, 2023
cudaMalloc and cudaHostAlloc size CUDA Programming and Performance	1	3950	February 13, 2012
cudaMalloc3D and friends proper use for whatever data type CUDA Programming and Performance	6	5933	July 14, 2010
use structure CUDA Programming and Performance	3	5978	October 27, 2007
float4 alignment inconsistency... CUDA Programming and Performance	3	2235	February 19, 2015
32b / 64b question - CUdeviceptr size CUDA Programming and Performance	5	18936	February 5, 2009

Getting the class size of non POD types for the device

Related topics