Another double precision issue Find out for which architecture a kernel has been compiled

santner · November 20, 2008, 8:55am

Hello,

I am trying to build functions using single and double precision, where the user can choose which precision he’d like to use. This of course is limited by the different compile modes (-arch sm_10 / -arch sm_13). What I can state now is the following:

[*]I can compile double precision kernels by using the -arch sm_13 flag. These functions run well on my 280GTX

[*]If compiled with -arch sm_13, all kernels (even those not using double) fail when using any older card.

What I’d like to have is a code switch like #ifdef sm_13, which allows for detecting which compilation architecture has been used.

Any clues?

Simon_Green · November 20, 2008, 10:40am

You need to compile separate versions of the kernel for sm_10 and sm_13 and then choose between them at runtime based on the compute capability of the card (which you can get from cudaGetDeviceProperties).

The Mandelbrot sample in the SDK shows how to do this.

santner · November 20, 2008, 11:58am

Thank you for your response.

That way is even more convenient except the additional work needed to compile two versions…

Topic		Replies	Views
Compile time architecture checking? CUDA Programming and Performance	1	1068	January 4, 2011
How to activate double-precision computation CUDA Programming and Performance	4	30405	September 14, 2009
Using double precision in CUDA how to turn on double precision in CUDA CUDA Programming and Performance	2	3074	July 27, 2008
Problem with running code with double precision values Double precision gives wrong result CUDA Programming and Performance	2	1240	August 28, 2009
double doesnt work in kernel CUDA Programming and Performance	7	3752	October 23, 2008
Wrong results for double precision calculations Not setting arch=sm_13 causes incorrect results (onl CUDA Programming and Performance	1	10230	October 26, 2010
-arch sm_13 business CUDA Programming and Performance	2	558	March 30, 2019
Compute 1.3 and invalid device function CUDA Programming and Performance	2	3150	January 30, 2009
How To Define Architecture in Makefile CUDA Programming and Performance	1	22122	April 12, 2011
enable double precision for SDK I can't figure out where in the makefile the -arch sm_13 should CUDA Programming and Performance	8	8113	November 16, 2009

Another double precision issue Find out for which architecture a kernel has been compiled

Related topics