Is it safe to use CUBIN objects created with Cuda 4.0 on a system with Cuda 4.1 ?

ftwinter · May 7, 2012, 10:01am

My application uses CUDA kernels for the bulk of the computations. For good reasons (out of scope of this question) I use a shared object/linking model to dynamically load the object files each of which contains 1 host function and 1 CUDA kernel. Since kernels can not be extern the basic structure of such a “kernel” is:

__global__ kernel() { ...code... }

extern "C" void call_kernel() {

  <<<GRID,BLOCK,SHMEM>>>kernel();

}

I use a host function which sole purpose is to call the kernel. For building the shared object i use:

nvcc -arch=sm_20 -m64 --compiler-options -fPIC,-shared -link -o kernel0.o kernel0.cu

The whole app uses lots of these kernels and they are loaded with dlopen(). The whole thing works fine if everything (building/loading/executing) stays on one machine A.

But when I compile/build the shared objects say on machine B (cuda 4.1, NVIDIA C2050), and dlopen them later on machine A (cuda 4.0, GTX 480) the computation does not yield the same result as if the shared objects were also build on machine A.

That sounds odd to me. Isn’t there a CUBIN object embedded in the .o file which contains instructions that are independent of the particular GPU architecture?

I know that it is advised to use the same compiler version for building and linking. Again, I have good reasons why not to build the shared objects on the same machine as they are executed.

Cross listed

Gilles_C · May 7, 2012, 11:02am

Hi,
I can hardly tell about the compatibility in your case, but I just wanted to draw your attention to the fact that the GPU compiler for compute capability 2.0 onwards is LLVM starting with cuda 4.1. It might be the case that your differences in results come from this difference in compiler rather than from mixing objects from different version. I would encourage you to test your whole application on the 4.1 environment and to compare the results with the ones you get from the 4.0 environment.

Topic		Replies	Views
any backward compatibility issue for CUDA 1.1? CUDA Programming and Performance	13	9736	December 21, 2007
Starting cuda development: linking problem CUDA Programming and Performance	3	1715	January 30, 2010
Issue between 4.0 and 4.1 CUDA Programming and Performance	0	755	April 27, 2012
Problem with CUDA release 4.1, using default LLVM compiler CUDA Programming and Performance	0	1006	February 12, 2012
compile and link multiple cuda files CUDA Programming and Performance	2	8097	December 7, 2007
Cuda 4.1 broke my kernel Upgraded from 4.0 to 4.1 CUDA Programming and Performance	7	1402	January 30, 2012
Missing kernels in a .cubin file? Not all of my kernels are showing up in a .cubin file CUDA Programming and Performance	5	9208	February 5, 2009
Running PTX Code from CUDA 4.0 in CUDA 4.1 or CUDA 4.2 CUDA Programming and Performance	5	2525	May 30, 2012
GT630 Compatibility? CUDA Programming and Performance	6	3864	December 8, 2012
NVCC forces c++ compilation of .cu files CUDA Programming and Performance	11	25940	December 11, 2011

Is it safe to use CUBIN objects created with Cuda 4.0 on a system with Cuda 4.1 ?

Related topics