__CUDA_ARCH__ undefined?!

njuffa · April 6, 2012, 6:21pm

As far as I understand the compilation process, tera’s explanation is right on the money. As an addendum, one reason CUDA_ARCH is undefined in host code is because for fatbinary compilation targeting multiple device architectures, host code is only compiled once, so it can’t be associated with any particular CUDA architecture.

The recommended way to check for the CUDA architecture in device code is something like this:

#if defined(__CUDA_ARCH__) && (__CUDA_ARCH__ >= 200)

In general CUDA architecture versions follow an onion-layer model, so the use of architectural features is usually best guarded by >= comparisons against CUDA_ARCH.

Topic		Replies	Views
__CUDA_ARCH__ undefined by NVCC on CUDA 3.2 RC CUDA Programming and Performance	15	3875	November 26, 2010
Is __CUDA_ARCH__ broken? CUDA Programming and Performance	3	12854	June 10, 2011
CUDA and nvcc: using the preprocessor to choose between float or double CUDA Programming and Performance	2	4305	January 10, 2012
[CUDA 4.0] : __CUDA_ARCH__ undefined in device code CUDA Programming and Performance	9	6844	July 14, 2011
CUDA architecture Macro CUDA Programming and Performance	2	1941	April 27, 2012
Use __CUDA_ARCH__ outside device function CUDA NVCC Compiler	0	372	November 18, 2022
Strange bug with __CUDA_ARCH__ and kernel template implicit instantiation CUDA Developer Tools	0	607	June 18, 2021
__CUDA_ARCH__ is not defined Jetson AGX Orin cuda	2	665	May 30, 2022
About Interval SDK Example How to compile for SM 2.0 CUDA Programming and Performance	1	6002	December 3, 2010
Fermi Flag CUDA Programming and Performance	8	8425	June 8, 2010

__CUDA_ARCH__ undefined?!

Related topics