Confusing results attempting to run CUDA app on target Linux systems

MarkP · March 30, 2009, 8:48pm

Hi,
I have added a CUDA-based GPU kernel as an alternate computation path for CUDA-capable target systems. In trying to understand how this will work I have found that I seem to get the expected “device does not support cuda” when running the app on an older system with non-cuda-capable GPU. However, I have also encountered a case in which the program is seemingly forced into emulation mode “Using device 0: Device emulation (CPU)” on a system which should be CUDA-capable. Is this an incompatability between my program and the target system or an obvious indicator that it can not support a CUDA code?

The system in question includes a GeForce 9500 GT, runs open SUSE 11.1 x86_64 Linux, and contains /usr/lib64/libcuda.so.180.22.
I build the test app on a RHEL5.1 x86_64 system and copy the app, libcudart.so.2, libcufft.so.2 libs to the target system.
ldd indicates all .so references are being satisfied.

This program will be run on by various users on several different vintages of Linux, NVIDIA drivers, and GPUs over which I will have no control. So understanding how/why it elicits different responses is quite important.

tmurray · March 30, 2009, 9:13pm

What compiler did you build with?

MarkP · March 30, 2009, 9:23pm

I used nvcc “release 2.0, V0.2.1221”

gcc used was 4.1.2 20071124

tmurray · March 30, 2009, 9:26pm

Hmmmm… so if you used CUDA 2.0 to build, included all .so files, and did everything with the pathing right, you should have no problems. However, I remember one thing in my dealings with SuSE (haven’t used it in a long time, so grain of salt required)–check that /dev/nvidia* have 0666 permissions. I remember that I had to do this on SuSE but on no other distribution ever, so if you’re trying to do this over ssh that might be the problem.

MarkP · March 30, 2009, 9:47pm

You may well be right. The current permissions are only 660 . Unfortunately, I need to wait until someone with root privileges is available to change it.
However, I think you’ve answered my question already: what I did (builds, .so’s, ldd check, etc.) should have been enough to get a non-Emu execution.

I’ll definitely reply back, since your info should be published as it infers a general gotcha for using GPU’s remotely on SUSE.

Any idea what “/dev/nvidiactl” is as opposed to “/dev/nvidia0” ? On my RHEL5.1 build system I see 666 for nvidiaia0 but only 600 for nvidiactl – and that works fine, since I’m shown as the owner for /dev/nvidiactl.

tmurray · March 30, 2009, 9:54pm

/dev/nvidiaN is each card, while /dev/nvidiactl is the system-wide management interface (I think).

MarkP · March 31, 2009, 5:06am

Turns out the incomplete permissions on /dev/nvidia* was only the first of two problems. In addition there was a missing symlink of libcuda.so to libcuda.so.1 in /usr/lib64 on the target system. (A little detective work with strace allowed me to find that.) No idea if the missing symlink is another SUSE-ism or operator error during the NVIDIA driver install on that system.
In any event it appears that for the target SUSE 11.1 system permissions need to be opened on the /dev/nvidia* devices and/or all users must be made a member of the “video” group. That would seem to be a general weakness with GPGPU useage thereon.

Topic		Replies	Views
CUDA Works as root but not as user on OpenSUSE 10.3 CUDA Programming and Performance	5	7933	November 6, 2008
Cuda without geforce CUDA Programming and Performance	9	5473	October 3, 2008
There is no device supporting CUDA CUDA Programming and Performance	11	22752	April 24, 2008
My system see only emulator but CUDA works CUDA Programming and Performance	4	5641	July 9, 2008
CUDA on [Non-supported] distros? CUDA Programming and Performance	19	23355	May 8, 2007
Another beginner question: CUDA on SUSE 11.0 CUDA Programming and Performance	6	3619	September 4, 2008
Getting started with CUDA initial problems on suse CUDA Programming and Performance	2	5226	March 14, 2007
Problems with compilation of SDK/suse10.2 CUDA Programming and Performance	2	4660	October 30, 2007
Possible to use CUDA with Ubuntu 8.04? with geforce 7600GT? CUDA Programming and Performance	13	22618	May 27, 2008
Emulation on Linux: basic questions CUDA Programming and Performance	9	13016	June 4, 2009

Confusing results attempting to run CUDA app on target Linux systems

Related topics