Cannot run SDK samples ('kernel execution failed')

Spaceknarf · December 5, 2008, 11:54pm

I’m having problems getting the SDK 2.1 samples to compile and run on Fedora 10.

My setup:

Hardware:

Intel D915GAG mobo

Pentium 4 2.8GHz

3GB RAM (DDR400, not running in dualchannel mode)

GeForce 6200 (which of course doesn’t support CUDA but I’ll list anyway for completeness)

Tesla C870

Software:

Fedora 10

gcc 4.3

Nvidia driver 177.82 installed from the RPMFusion repository

Toolkit ‘cuda-linux-rel-nightly-2.1.1635-3065709’

SDK ‘cuda-sdk-linux-2.10.1126.1520-3141441’

packages freeglut and freeglut-devel 2.4

My compiling problems:

The projects threadMigration, matrixMulDrv and simpleTextureDrv will not compile. I get this error:

make[1]: Entering directory `/home/frank/NVIDIA_CUDA_SDK/projects/simpleTextureDrv'

/usr/bin/ld: cannot find -lcuda

collect2: ld returned 1 exit status

make[1]: *** [../../bin/linux/release/simpleTextureDrv] Error 1

make[1]: Leaving directory `/home/frank/NVIDIA_CUDA_SDK/projects/simpleTextureDrv'

make: *** [projects/simpleTextureDrv/Makefile.ph_build] Error 2

I have looked at this topic: error: cannot find -lcuda, but it doesn’t solve my problem. I think I have all files, links and paths correct, as shown by the following printouts:

[frank@localhost ~]$ ls /usr/local/cuda/lib

libcublasemu.so	  libcublas.so.2	libcudart.so.2.1	libcufft.so

libcublasemu.so.2	libcublas.so.2.1  libcufftemu.so	  libcufft.so.2

libcublasemu.so.2.1  libcudart.so	  libcufftemu.so.2	libcufft.so.2.1

libcublas.so		 libcudart.so.2	libcufftemu.so.2.1

[frank@localhost ~]$ ls /usr/lib/nvidia

libcuda.so		   libGL.so.1			   libnvidia-tls.so.177.82

libcuda.so.1		 libGL.so.177.82		  libXvMCNVIDIA_dynamic.so.1

libcuda.so.177.82	libnvidia-cfg.so.1	   libXvMCNVIDIA.so.177.82

libGLcore.so.1	   libnvidia-cfg.so.177.82  tls

libGLcore.so.177.82  libnvidia-tls.so.1

[frank@localhost ~]$ set | grep PATH

LD_LIBRARY_PATH=/usr/local/cuda/lib

PATH=/usr/lib/qt-3.3/bin:/usr/kerberos/bin:/usr/lib/ccache:/usr/local/bin:/usr/bin:/bin:/usr/local/sbin:/usr/sbin:/sbin:/usr/local/cuda/bin:/home/frank/bin

[frank@localhost ~]$ ldconfig -p | grep cuda

	libicudata.so.40 (ELF) => /usr/lib/libicudata.so.40

	libcufftemu.so.2 (libc6) => /usr/local/cuda/lib/libcufftemu.so.2

	libcufftemu.so (libc6) => /usr/local/cuda/lib/libcufftemu.so

	libcufft.so.2 (libc6) => /usr/local/cuda/lib/libcufft.so.2

	libcufft.so (libc6) => /usr/local/cuda/lib/libcufft.so

	libcudart.so.2 (libc6) => /usr/local/cuda/lib/libcudart.so.2

	libcudart.so (libc6) => /usr/local/cuda/lib/libcudart.so

	libcuda.so.1 (libc6) => /usr/lib/nvidia/libcuda.so.1

	libcuda.so (libc6) => /usr/lib/nvidia/libcuda.so

	libcublasemu.so.2 (libc6) => /usr/local/cuda/lib/libcublasemu.so.2

	libcublasemu.so (libc6) => /usr/local/cuda/lib/libcublasemu.so

	libcublas.so.2 (libc6) => /usr/local/cuda/lib/libcublas.so.2

	libcublas.so (libc6) => /usr/local/cuda/lib/libcublas.so

After removing the three projects that don’t compile, the rest compiles fine (barring a few compiler warnings). However, when I proceed to run one of the programs, I get the following error messages (I have run all programs, I have pasted all the different messages I get):

oceanFFT

CUDA FFT Ocean Simulation

Left mouse button		  - rotate

Middle mouse button		- pan

Left + middle mouse button - zoom

'w' key					- toggle wireframe

cudaSafeCall() Runtime API error in file <oceanFFT.cpp>, line 273 : unknown error.

alignedTypes

Allocating memory...

Generating host input data array...

Uploading input data to GPU memory...

Testing misaligned types...

uint8...

cutilCheckMsg() CUTIL CUDA error: testKernel() execution failed

 in file <alignedTypes.cu>, line 223 : invalid device function .

asyncAPI

time spent executing by the GPU: 166.93

time spent by CPU in CUDA calls: 0.08

CPU executed 14899 iterations while waiting for GPU to finish

--------------------------------------------------------------

Test FAILED

bicubicTexture

Loaded 'lena_bw.pgm', 512 x 512 pixels

cudaSafeCall() Runtime API error in file <bicubicTexture.cu>, line 500 : invalid texture reference.

binomialOptions

Using single precision...

Using device 0: Tesla C870

Generating input data...

Running GPU binomial tree...

cudaSafeCall() Runtime API error in file <binomialOptions_kernel.cuh>, line 187 : invalid device symbol.

imageDenoising

Allocating host and CUDA memory and loading image file...

Loading ./../../../projects/imageDenoising/data/portrait_noise.bmp...

BMP width: 320

BMP height: 408

BMP file loaded successfully!

Data init done.

Initializing GLUT...

Loading extensions: No error

OpenGL window created.

Creating GL texture...

Texture created.

Creating PBO...

cudaSafeCall() Runtime API error in file <imageDenoisingGL.cpp>, line 407 : unknown error.

recursiveGaussian

This sample needs a card capable of OpenGL and display.

Please choose a different device with the -device=x argument.

simpleCUBLAS

simpleCUBLAS test running..

!!!! kernel execution error.

simpleCUFFT

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/execute.cu, line 1070

cufft: ERROR: CUFFT_EXEC_FAILED

cufft: ERROR: /root/cuda-stuff/sw/rel/gpgpu/toolkit/r2.1/cufft/src/cufft.cu, line 151

cufft: ERROR: CUFFT_EXEC_FAILED

cufftSafeCall() CUFFT error in file <simpleCUFFT.cu>, line 127.

deviceQuery and bandwidthTest run fine:

There is 1 device supporting CUDA

Device 0: "Tesla C870"

  Major revision number:						 1

  Minor revision number:						 0

  Total amount of global memory:				 1610350592 bytes

  Number of multiprocessors:					 16

  Number of cores:							   128

  Total amount of constant memory:			   65536 bytes

  Total amount of shared memory per block:	   16384 bytes

  Total number of registers available per block: 8192

  Warp size:									 32

  Maximum number of threads per block:		   512

  Maximum sizes of each dimension of a block:	512 x 512 x 64

  Maximum sizes of each dimension of a grid:	 65535 x 65535 x 1

  Maximum memory pitch:						  262144 bytes

  Texture alignment:							 256 bytes

  Clock rate:									1.35 GHz

  Concurrent copy and execution:				 No

Test PASSED

I do know that Fedora 10 is not officially supported, but it did run on my Fedora 8 installation (with Toolkit and SDK 1.1). I really have no idea how to proceed from this point. Anyone?

E.D_Riedijk · December 6, 2008, 8:21am

Install Fedora Core 9?

Spaceknarf · December 6, 2008, 8:43am

Well, I’d like to try it on 10 first. Even though 10 is not officially supported, doesn’t mean that it absolutely won’t work.

E.D_Riedijk · December 6, 2008, 11:40am

Well, your post seems to indicate it does not work ;)
One thing I noticed is that all projects that do not want to compile are using the driver API.
Projects using OpenGL will likely not run for you, as there was some change in 2.1. For me they only run when using a GPU that can so OpenGL itself (in my case 8800GTX). Running on a Tesla will not work at this time for these samples (I think the 2.1 SDK will fix that)

One way to make it work in FC10 could be to install the gcc from FC9. But I am afraid that will open up a whole can of worms with libraries and stuff, so if you do not need new features from FC10…

Good to see more people from NL doing CUDA. Are you doing medical imaging? (I was doing MRI stuff when I was at uni and have seen lots of CUDA related papers about MRI the last year and a half)

DexterMorgan103 · December 20, 2008, 2:29am

I am also running Fedora 10, I have success compiling the files as well as running the executables so it does work to a degree. What I noticed in your code that is different from mine is that you are declaring you paths differently.

PATH=$PATH:/cudalocation/bin
LD_LIBRARY=$LD_LIBRARY:/cudalocation/lib

export PATH
export LD_LIBRARY

is how I do it. Then go to the project you want to compile and make sure you make a release version using make release=1. Note you path must be where the MakeFile is for you project. Sometimes when you try to make it you will get Error 1 or Error 2. Run you paths again and you should be fine.
Now the only thing is when I get the API projects to compile I only get one frame per second lol. I will let you know if I make progress. So I think the previous post might be right… but I won’t give up on Fedora 10 yet and neither should YOU!

Topic		Replies	Views
GM107 + CUDA 6.0 CUDA Setup and Installation	19	6614	March 13, 2014
I'm failed in make SDK sample codes Some codes are 'make' well, but the other codes didn CUDA Programming and Performance	2	945	August 10, 2011
Linux CUDA kbuntu/ubuntu 11.10 CUDA Programming and Performance	13	101339	November 25, 2011
CUDA 2.1 problems with make,OpenGL and others CUDA Programming and Performance	0	5175	November 24, 2008
CUDA on Fedora 10: Unable to compile CUDA Programming and Performance	6	7415	May 17, 2009
Compiling SDK on opensuse CUDA Programming and Performance	12	14116	August 21, 2009
Yet another CUDA 3.1 confusion Can't compile examples under CUDA 3.1 cannot find -lrendercheckgl CUDA Programming and Performance	13	5741	February 13, 2015
problem running demos CUDA Programming and Performance	9	8203	January 1, 2009
SDK sample code failures only on samples that launch a kernel CUDA Programming and Performance	17	8658	January 7, 2009
CUDA on [Non-supported] distros? CUDA Programming and Performance	19	23301	May 8, 2007

Cannot run SDK samples ('kernel execution failed')

Related topics