CUDA 2.1 problems with make,OpenGL and others

I compiled CUDA SDK 2.0 with 2.1 Toolkit, there is no problem running make with no options but make dbg=1 or emu=1 fails.

Device 0 is GeForce 8800GTS 512 @PCIe x4

Device 1 is Tesla c870 @ PCIe x16

System is Fedora 9 x64, gcc 4.3.0(yes, I added the missing headers in SDK), nvidia driver 180.06

Output of make dbg=1

...

make -C projects/reduction/ 

make[1]: Entering directory `/home/t0bi/NVIDIA_CUDA_SDK/projects/reduction'

reduction_kernel.cu: In function ‘void reduce_sm10(int, int, int, int, T*, T*)’:

reduction_kernel.cu:337: error: ‘reduce0_sm10’ was not declared in this scope

reduction_kernel.cu:340: error: ‘reduce1_sm10’ was not declared in this scope

reduction_kernel.cu:343: error: ‘reduce2_sm10’ was not declared in this scope

reduction_kernel.cu:346: error: ‘reduce3_sm10’ was not declared in this scope

make[1]: *** [obj/debug/reduction_kernel_sm10.cu_o] Error 255

make[1]: Leaving directory `/home/t0bi/NVIDIA_CUDA_SDK/projects/reduction'

make: *** [projects/reduction/Makefile.ph_build] Error 2

Output of make emu=1

...

make -C projects/matrixMulDrv/ 

make[1]: Entering directory `/home/t0bi/NVIDIA_CUDA_SDK/projects/matrixMulDrv'

nvcc fatal   : Emulation mode is not compatible with the phase specified

make[1]: *** [data/matrixMul_kernel.cubin] Error 255

make[1]: Leaving directory `/home/t0bi/NVIDIA_CUDA_SDK/projects/matrixMulDrv'

make: *** [projects/matrixMulDrv/Makefile.ph_build] Error 2

Next problem is OpenGL with Tesla c870. When I try to run app from SDK on c870 that use OpenGL, seems to freeze(nbody, particles) or raises segmentation fault (simpleGL) or prints in console (fluidsGL) :

$ ./fluidsGL -device=1

Using device 1: Tesla C870

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 316

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 316

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 316

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 316

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151

cufft: ERROR: CUFFT_EXEC_FAILED

There is no problem running on Device 0 (Geforce 8800 GTS 512), and even runs very,very fast comparing to 2.0. See the attached file, but don’t belive the fps and GFLOPS ;-) but e g nbody has about 60 fps instead 30 I used to have on 8800GTS. Particles demo runs about 200 fps, after increasing iterations count to 10 fps drops to 40. Does new driver really give so big speedup ?

When will be SDK 2.1 beta released ?
cuda2.1_in_action.jpg