Invalid device function

muxa · November 16, 2008, 7:38pm

hi2all

CUDA ref. manual says that return value of cudaLaunch func. is one of the following:
[font=“Courier New”]cudaSuccess
cudaErrorInvalidDeviceFunction
cudaErrorInvalidConï¬guration[/font]
my question is What can cause cudaErrorInvalidDeviceFunction as a result of a kernel call?

I wrote a multi-gpu code, which works on every CUDA-capable PC except one.
On “WinXP, 2x GTX 280”-machine program exits with cudaErrorInvalidDeviceFunction?

does anyone of you guys know where can be my problem?

alex_dubinsky · November 17, 2008, 1:08am

Did you see this other thread: [url=“http://forums.nvidia.com/index.php?showtopic=80370&st=0&p=456174&#entry456174”]http://forums.nvidia.com/index.php?showtop...mp;#entry456174[/url]

“Solved. There was another cuda call initializing a pbo which was wrong and somehow killed cuda. The code up there now works fine.”

muxa · November 17, 2008, 6:07am

I did. So, the reason why I get cudaErrorInvalidDeviceFunction as a result of a kernel call is that device is “busy”. Some other code running a device, or even mine. Right?

Or, this is one of all possible reasons why i can get this error.

alex_dubinsky · November 17, 2008, 7:55am

That person seems to say that an incorrect call corrupted the GPU’s memory, possibly messing up the kernel’s code or just scrambling the nvidia driver’s state. Possibly, it could be an ordinary out-of-bounds access that is scrambling driver memory. I don’t know if this is what you’re seeing, but overwriting-memory bugs tend to be semi-non-deterministic and may manifest themselves in one configuration but not another.

For sake of completeness, what are those other systems on which your code does work?

muxa · November 17, 2008, 8:23am

cuda version:

2.0

nvcc version:

Built on Wed_Jul_16_12:57:50_PDT_2008

Release 2.0, V0.2.1221

does not work on:

WinXP, 4GB RAM, 2x GTX280 (any mode)

does work on:

Ubuntu, 4GB RAM, 2x GTX280 (NonEmu) (the same machine, but under WinXP there is a bug discussed above)

WinXP, 2 GB RAM, 8600 GTS. (NonEmu, Emu)

WinXP, 4GB, C2Duo 3GHz (Emu)

WinXP, 512MB, Intel 1.5GHz (Emu)

muxa · November 17, 2008, 8:48am

thanks, I’ll check that theory.

(but sdk-samples are ok)

alex_dubinsky · November 17, 2008, 6:46pm

Your code doesn’t work even in emu? Very strange indeed. EDIT: Hold on, driver api supports emulation now?

Also, do you recompile on every machine you run on, or do you have one binary (for the win machines)?

muxa · November 18, 2008, 6:18am

sorry, I was missinformed. in emu-mode everything works fine.

the code is always recompiled on every machine before run.

I also tried to run an empty kernel instead of existing in program by calling it with <<<dim3(1,1,1), dim3(1,1,1)>>> configuration. the same reaction on the FIRST kernel call.

upd: Who told you I am using driver API?

alex_dubinsky · November 18, 2008, 7:30pm

Recompiling is what typically makes out-of-bounds bugs surface or hide. You can try compiling the code on a windows machine where it works, and see if it’ll run on the machine that doesn’t. Although this won’t really tell you much.

Btw, is your windows machine with 4GB using an x64 OS?

Sorry, I thought you’d said you were using cuLaunch.

muxa · November 19, 2008, 12:46pm

Hmmm… Not sure if I get it. What do you mean?

Ok, I’ll check it. But I thought the resulting binary code does not depend on a machine it was compiled on (at least under windows-family OS).

Yeap. WinXP x64-version.

alex_dubinsky · November 19, 2008, 10:02pm

And are the machine on which it works x32? If so, this is an important point you should have said from the start. Probably, your error is in dealing with the x64 architecture. E.g., you do sizeof(long) instead of sizeof(void*) somewhere or somesuch. This causes an out-of-bounds access.

What I meant about “surface and hide” is that an out-of-bounds bug will sometimes appear during one compile, and not appear on another. That is how it’s not quite deterministic. Sometimes you may change a completely unrelated line of code, and it will toggle the bug.

Topic		Replies	Views
cudaErrorInvalidDeviceFunction Simple program throwing cudaErrorInvalidDeviceFunction error CUDA Programming and Performance	1	2513	April 24, 2010
invalid device function, all CUDA-capable devices are busy or unavailable CUDA Programming and Performance	5	7748	July 6, 2013
cudaErrorInvalidDeviceFunction CUDA Programming and Performance cuda , jetson	6	2700	September 26, 2022
Invalid Device Kernel after upgrade to Cuda 10.x CUDA Programming and Performance	9	1655	May 9, 2019
(error 98) due to "invalid device function" for a very simple templated kernel example CUDA Programming and Performance cuda , kernel	3	3512	July 8, 2020
Cuda Error #4 that requires PC Reboot, Help!!! CUDA Programming and Performance	17	9573	September 17, 2013
cudaErrorInvalidDeviceFunction: kernel fails to load, but stuck there CUDA Programming and Performance	6	63	July 19, 2024
Always got this warning when nvprof cuda file "This can happen if device ran out of memory or if a device kernel was stopped due to an assertion" on just HellowWorld GPU CUDA Programming and Performance	9	2557	January 31, 2019
Strange crashes in __device__ function CUDA Programming and Performance	4	1060	August 1, 2016
Error when using cudaLaunch cudaErrorInvalidDeviceFunction error CUDA Programming and Performance	1	4553	July 1, 2009

Invalid device function

Related topics