OOP Class Design with Device Variables

jay333 · August 5, 2021, 11:17pm

I’m writing a class with pointer variables to device memory. The class is a copy of an existing class on the host, except it runs the main computational function on the GPU. It looks something like this:

class GPUClass {
public:
   GPUClass() {
      cudaMalloc(&device_ptr1,...)
      cudaMalloc(&device_ptr2,...)
      ...
   }

   void GPUComputeFunction(...) {...}

   ~GPUClass() {
      cudaFree(device_ptr1)
      cudaFree(device_ptr2)
      ...
   }

private:
   void* device_ptr1;
   void* device_ptr2;
   ...
}

Here, I allocate the device memory in the class constructor and de-allocate it in the destructor. I’m wondering if there are any pitfalls to this approach, or if there are better ways to do it?

Robert_Crovella · August 5, 2021, 11:26pm

If you pass an object of this class to a device kernel using pass-by-value semantics, an object-copy will be made as part of C++ pass-by-value semantics. At the completion of the function (i.e. kernel) call, the object copy destructor will get called. Think about the implications of that carefully. It probably would mess you up.
If you have an object of this class declared at global scope, the constructor/destructor can get called outside of main. This is frowned on, and as your application is quitting you may get an error returned from the destructor (for example if you ran compute-sanitizer).

You can find questions on various forums pertaining to both of these issues that have bitten people.

So putting CUDA calls in the destructor is often troublesome. In short, don’t do that. One possible approach is to create and use specific object initialize/deinit methods that you call manually.

If you want to follow a high-quality C++ approach, you might wish to study thrust, although its not for the faint of heart, or just use thrust.

Topic		Replies	Views
How to use class in CUDA C++? CUDA Programming and Performance	1	19433	May 29, 2018
__CUDA_ARCH__ in object methods not working CUDA Programming and Performance	3	1109	October 30, 2019
Copying objects to device with virtual functions CUDA Programming and Performance	5	3762	November 9, 2017
Invalid Device Pointer CUDA Programming and Performance	9	24506	January 15, 2009
CUDA card memory device pointers CUDA Programming and Performance	5	4739	April 28, 2009
How to allocate class? CUDA Programming and Performance	4	1742	February 8, 2019
use pointer in c++ class CUDA Programming and Performance	5	3411	August 5, 2014
How to access host objects from the device? (A malloc question) CUDA Programming and Performance	0	400	March 4, 2020
Non-POD class copy CUDA to CPU CUDA Programming and Performance	2	980	October 12, 2021
Passing pointer from C++ into cuda host code and copying results to that pointer am I crazy?? CUDA Programming and Performance	1	2056	July 9, 2011

OOP Class Design with Device Variables

Related topics