Cuda 10 & VS2017 & C++17

nvidia9dk7k · November 8, 2018, 7:49pm

if I move my VS 2017 C++ project over to C/C++ Language ISO C++17, the .cu files won’t compile any longer, with a message “nvcc fatal : Compiler ‘cl.exe’ in PATH different than the one specified with -ccbin”

The ccbin portion of the command line is "-ccbin “C:\Program Files (x86)\Microsoft Visual Studio\2017\Enterprise\VC\Tools\MSVC\14.15.26726\bin\HostX86\x64”, but I’m pretty sure the cl.exe that is used on the path is different when compiling for C++17.

Help.

Robert_Crovella · November 8, 2018, 8:01pm

CUDA 10.0 supports C++14 but not C++17.

[url]https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#c-cplusplus-language-support[/url]

If you need C++17 host code, put that in a .cpp file. And you may need to put it in a separate project.

cr_dave · November 11, 2018, 10:37am

If you need C++17 device code, you can compile CUDA with clang. However, I don’t believe CUDA version 10 is supported even in the most recent clang builds. I wanted to try out some C++17 in device code and I got CUDA 9.0 working with clang 6 (however 9.2 didn’t work - I didn’t try 9.1, but from internet searches I don’t believe it works either). I haven’t tried clang 7 (which supports compiling relocatable device code, but I don’t know which version of CUDA it supports) and I don’t think clang 8 has been released yet.

There are some small differences between clang CUDA and nvcc CUDA (some for the worse, some for the better). I had to make a couple of minor changes to my project, but otherwise it should “just work” for the most part. Seems oddly slower, for some reason unknown to me, but since it is just for testing purposes for now, that doesn’t bother me.

https://llvm.org/docs/CompileCudaWithLLVM.html

One example of changes I had to make is calling functions like max/min from host code. In nvcc, you can use the global namespace, just call max(a,b) or ::max(a,b). In clang, you have to call std::max(a,b), which means for host device code you have to either overload the function with host or device, which you can do in clang, but not nvcc, e.g.:

___host__ function1(){ ... std::max(a,b) ... }

__device__ function1() { ... ::max(a,b) ... }

or within the function detect if compiling for device vs host (and clang vs nvcc if you wish) and write different code or make a macro to handle it, which is what these developers did:

http://eigen.tuxfamily.org/bz_attachmentbase/attachment.cgi?id=671

A simple version below:

//compiles in clang CUDA for host code
#if defined(__clang__) && defined(__CUDA__) && !defined(__CUDA_ARCH__)
#define USE_STD_NAMESPACE 1
#endif

#if defined  USE_STD_NAMESPACE
#define __STD__ std
#else
#define __STD__
#endif

___host__ __device__ function1(){ ... __STD__::max(a,b) ... }

I wrote that quickly, but I think that should work too while keeping the code kosher for nvcc as well.

EDIT: I recognize that you are using Visual Studio, you can get clang working with visual studio, though it is a bit tricky. So I guess this is if you really want c++17 device code. :)

treinz · December 1, 2018, 1:13am

If you need C++17 device code, you can compile CUDA with clang. However, I don’t believe CUDA version 10 is supported even in the most recent clang builds. I wanted to try out some C++17 in device code and I got CUDA 9.0 working with clang 6 (however 9.2 didn’t work - I didn’t try 9.1, but from internet searches I don’t believe it works either). I haven’t tried clang 7 (which supports compiling relocatable device code, but I don’t know which version of CUDA it supports) and I don’t think clang 8 has been released yet.

There are some small differences between clang CUDA and nvcc CUDA (some for the worse, some for the better). I had to make a couple of minor changes to my project, but otherwise it should “just work” for the most part. Seems oddly slower, for some reason unknown to me, but since it is just for testing purposes for now, that doesn’t bother me.

Compiling CUDA with clang — LLVM 18.0.0git documentation

One example of changes I had to make is calling functions like max/min from host code. In nvcc, you can use the global namespace, just call max(a,b) or ::max(a,b). In clang, you have to call std::max(a,b), which means for host device code you have to either overload the function with host or device, which you can do in clang, but not nvcc, e.g.:
___host__ function1(){ ... std::max(a,b) ... }

__device__ function1() { ... ::max(a,b) ... }
or within the function detect if compiling for device vs host (and clang vs nvcc if you wish) and write different code or make a macro to handle it, which is what these developers did:

http://eigen.tuxfamily.org/bz_attachmentbase/attachment.cgi?id=671

A simple version below:
//compiles in clang CUDA for host code
#if defined(__clang__) && defined(__CUDA__) && !defined(__CUDA_ARCH__)
#define USE_STD_NAMESPACE 1
#endif

#if defined  USE_STD_NAMESPACE
#define __STD__ std
#else
#define __STD__
#endif

___host__ __device__ function1(){ ... __STD__::max(a,b) ... }
I wrote that quickly, but I think that should work too while keeping the code kosher for nvcc as well.

EDIT: I recognize that you are using Visual Studio, you can get clang working with visual studio, though it is a bit tricky. So I guess this is if you really want c++17 device code. :)

Hi, when you use clang for CUDA project, do you manage to use cuda-gdb to debug code inside CUDA kernel? Thanks!

cr_dave · December 1, 2018, 3:58am

Great question! I don’t know. :) I’ve just been playing around with clang-CUDA for fun.

It’s also entirely possible that standard debuggers (GDB/LLDB) that work with clang would simply just work. But I haven’t tried.

HolyChen · July 8, 2019, 9:17am

Hi, after a year, how about C++17 compatibility with CUDA? Is there a roadmap about supporting latest C++ standard. constexpr if, new type traits and some new function template like std::invoke are useful.

nico88chessa · March 28, 2020, 2:25pm

Hi, I take this post again. Are there any news about C++17 support for CUDA? Any roadmaps? It could be interesting using if constexpr and other language improvements.

yuri.syrov · October 3, 2021, 8:13pm

I’ve found few machines where NVCC fails to compile CUDA examples with
nvcc fatal : Compiler ‘cl.exe’ in PATH different than the one specified with -ccbin
or simply errorlevel 1

Reason was in anaconda setting
HKEY_CURRENT_USER\Software\Microsoft\Command Processor\AutoRun

see

for explanation

Topic		Replies	Views
CUDA 8 support for c++14, Windows/Linux CUDA Programming and Performance	7	9894	October 26, 2016
Build Error MSB3721 When calling object method within kernel, using compiler directives CUDA Programming and Performance	9	5718	November 18, 2015
CUDA 10 support for Visual Studio 2017 Update 8 and later CUDA Setup and Installation	11	14608	May 23, 2019
nvcc: C++11 standard in CUDA frontend? (dependencies, gcc, Windows vs. Linux) CUDA Setup and Installation	9	29097	September 19, 2014
Unable to compile CUDA file CUDA Setup and Installation	9	10186	May 19, 2017
CUDA not working with VS 2017 update? CUDA Programming and Performance	16	8822	September 3, 2017
Ubuntu 18.04 nvcc can't find appropriate host compiler CUDA Setup and Installation	1	4040	July 11, 2018
Debug cuda kernel code compiled by llvm/clang CUDA Programming and Performance	4	2048	December 3, 2018
nvcc & clang 7 (no typo here) CUDA Programming and Performance	36	33348	September 30, 2016
Visual Studio 2017 not detecting changes in CUDA .cu files CUDA Setup and Installation	37	11604	August 26, 2021

Cuda 10 & VS2017 & C++17

Related topics