Std::visit a std::variant

o.o · March 31, 2021, 5:43pm

I wonder if one can visit a variant in device code, since c++17 is supposedly supported in CUDA.

Consider this example (with cuda 11.1):

#include <variant>
#include <cstdio>

struct A{};
struct B{};
struct Visitor {
  __device__
  void operator()(const A&) { printf("It's an A!\n"); }
  __device__
  void operator()(const B&) { printf("It's a  B!\n"); }
};

__global__
void visitVariant() {
  std::variant<A, B> var{A{}};
  std::visit(Visitor{}, var);
}

int main() {
  visitVariant<<<1, 1>>>();
  cudaDeviceSynchronize();
  return 0;
}

First of all, it doesn’t compile unless we use --expt-relaxed-constexpr. Fine - let’s do that.
The problem, however, is the fact that the compiler is unable to put device functions in the jump table of std::visit.
You can make it compile if you replace as follows

- __device__
+ __host__ __device__
  void operator()(const A&) { printf("It's an A!\n"); }

but then the compiler places /*__host__*/ void operator() in the jump table, which leads to a crash.

It seems that std::visit is unusable in device code. Is this supposed to work?

Note: Manual jump tables work just fine

  __global__
  void visitVariant() {
    std::variant<A, B> var{A{}};
-   std::visit(Visitor{}, var);
+   if (std::holds_alternative<A>(var)) {
+     Visitor()(std::get<A>(var));
+   } else if (std::holds_alternative<B>(var)) {
+     Visitor()(std::get<B>(var));
+   }
}

Output:

It's an A!

njuffa · March 31, 2021, 6:02pm

The CUDA documentation states that C++17 is supported with restrictions. I am not knowledgeable about C++17 (yet), but before you file a bug with NVIDIA, check the docs:

G.4.18. C++17 Features

o.o · March 31, 2021, 6:05pm

Thanks for the link!

I had checked that part already (it’s directly reachable from the part of the documentation I linked at the beginning of my first post). It seems to me that the two restrictions listed there don’t apply to the problem at hand.

Let’s see if somebody has an idea.

Robert_Crovella · March 31, 2021, 7:27pm

Topic		Replies	Views
questions about CUDA 3.1 CUDA Programming and Performance	2	3257	July 14, 2010
Some easy, but useful questions CUDA Programming and Performance	6	4155	July 11, 2008
Force nested inlining to avoid redundant function calls CUDA Programming and Performance cuda , kernel	3	478	December 8, 2023
Can Cuda work with c#? CUDA Programming and Performance	7	58389	June 12, 2007
How to make device member function directly access class member? CUDA Programming and Performance	5	633	June 17, 2023
cuda function pointers to class member functions (change in Pascal?) CUDA Programming and Performance	5	1177	June 2, 2019
Small question about function call CUDA Programming and Performance cuda	4	350	April 8, 2020
"Selective usage" of __device__ in template class CUDA Programming and Performance	0	380	February 1, 2021
Struct calling a __host__ function from a __device__ function is not allowed or a crash CUDA Programming and Performance	7	7016	October 16, 2015
__host__ and __device__ qualifies CUDA Programming and Performance	1	4407	February 13, 2010

Std::visit a std::variant

Note: Manual jump tables work just fine

G.4.18. C++17 Features

Related topics