Invalid Device Function

raftpeople · November 24, 2008, 6:51pm

I’m getting this error when launching a kernel. I have successfully compiled and executed sample programs (scalarProd), so I think I’m just doing something wrong. But before getting into too many details, will I get this error if I had the following because the kernel hasn’t had time to launch?:

testGPUKernel<<<32,32>>>();

	CUT_CHECK_ERROR("Kernel failed");

I understand that a memcpy from the device back to the host will wait for all threads to complete, is the appropriate time to check for kernel status after a memcpy back?

danijel · November 24, 2008, 7:12pm

How are you compiling your test program? I had some problems with compiling from within Visual Studio, so I created a Makefile and compiled from the console and everything worked fine…

raftpeople · November 24, 2008, 7:57pm

I’m using Visual Studio Express 2005, using Build Project. My project was created using the CUDAWinApp template/wizard. It compiles OK and runs fine if I’m only doing memcpy stuff, once I try to launch the kernel and do the CUT_CHECK_ERROR it tells me kernel failed. The project was originally a “hello world” project with all code in “main”, and it ran ok (the kernal created the “hello world” string). I left the main routine in place and just added my code by adding other routines, later I removed all of the code from “main” in an attempt to eliminate variables with this problem but it did not change anything. I assume that with a DLL the main routine is not ever executed?? (never created a DLL before).

Here are my other details, I didn’t post them originally because I don’t even know if it’s valid to perform CUT_CHECK_ERROR immediately after kernel launch or not.

WinXP (new box, assume SP2, but it’s not in front of me so can’t say for sure)

GTX280 (drivers downloaded 2 weeks ago)

Java 1.6.0_10

Visual Studio Express 2005, C++ (downloaded 2 weeks ago)

Java loads C++ DLL

Calls native routines using JNI

All tests between Java and C++ code work properly, can xfer data, perform calcs, return results, results match the same code in Java

Java call to C++ routine to allocate memory on device works, that is to say CUDA_SAFE_CALL doesn’t spit out any errors

Subsequent Java call to C++ to launch kernel fails with the “invalid device function”

At this point I’ve eliminated almost all of the code in the kernel and in the routine that launches the kernel, this is what they look like (source is at home, keying from memory, but I eliminated everything but what is shown):

__global__ void testGPUKernel() {

	__shared__ int a;

	a=1;

}

extern "C" JNIEXPORT __declspec(dllexport) jint JNICALL Java_TestGPUCalls_testNtvGPUCalc(JNIEnv *, jobject) {

	jint result;

	result=0;

	testGPUKernel<<<32,32>>>();

	CUT_CHECK_ERROR("Kernel failed");	(****** DON'T REMEMBER EXACT LINE OF CODE HERE, BUT IT WAS COPIED FROM ONE OF SAMPLE PGMS *****)

	return result;	

}

danijel · November 24, 2008, 8:13pm

Exactly. I tried to use the same wizard and had the same problem as you describe. I’m not sure if its the compiler settings that that wizard creates, but for some reasons the kernel functions fail.

I know this is not a solution, but it can help you determine the cause a bit easier. Here’s a Makefile I use in one of my projects:

[codebox]

CC=“C:\Program Files\Microsoft Visual Studio 8\VC\bin\cl.exe” /EHsc

NVCC=“C:\CUDA\bin\nvcc.exe”

VSBIN=“C:\Program Files\Microsoft Visual Studio 8\VC\bin”

LIBS=“C:\CUDA\lib\cudart.lib” “C:\Program Files\NVIDIA Corporation\NVIDIA CUDA SDK\common\lib\cutil32.lib” user32.lib

LIBDIR=“C:\Program Files\NVIDIA Corporation\NVIDIA CUDA SDK\common\lib”

LIBDIR2=“C:\CUDA\lib”

INCDIR=“C:\Program Files\NVIDIA Corporation\NVIDIA CUDA SDK\common\inc”

all: main.exe

main.exe: main.obj gpu.obj

$(CC) $(LIBS) main.obj gpu.obj

main.obj: main.cpp

$(CC) main.cpp -c

gpu.obj: gpu.cu

$(NVCC) -ccbin $(VSBIN) -I$(INCDIR) -c -o gpu.obj gpu.cu

clean:

del *.obj

[/codebox]

You might also want have to setup the system variables prior to compiling. The batch file that does this is in “C:\Program Files\Microsoft Visual Studio 8\VC\vcvarsall.bat” on my system.

I also attatched another simple project I made for testing.
test.zip (164 KB)

raftpeople · November 24, 2008, 8:40pm

Thanks, I’ll try it tonight. I’m a newb when it comes to DLL’s, and pretty rusty on C (it’s been a long time), do you know what I need to do to your makefile to instruct it to:

Create a DLL
Name the final executable myprojectname.dll

raftpeople · November 26, 2008, 7:57am

A follow up in case anyone else runs into this problem also.

danijel, thanks for your help, I used the makefile and your test code and it compiled and executed fine. Then I used your makefile to compile my code and it is now running just fine, no “invalid device function” anymore. I think your right, there is something with the CUDAWinApp project wizard that was causing the problem because I’m also able to build and execute the sample projects from NVIDIA through VS2005 without a problem, only the wizard app was having a problem.

I was able to stumble my way through passing CL options through NVCC to be able to create my DLL. What I don’t understand is why MS documentation says /LD creates a DLL but that gave me a link error, so I looked at the command line in VS and it showed /DLL, I tried passing that and it worked.

kyzhao · November 27, 2008, 5:46am

By default, for using debug you must set project → property → CUDA → Output → Intern mode: set Real <= Very important
sorry about it.
i set the default value for emudebug but not true device~~
so get this error~~ :">
i will change the default value in the next version soon.

thanks for your information…

raftpeople · November 27, 2008, 8:06am

Ok, thanks, I’ll give it another try.

Matt_Slezak · January 24, 2010, 2:35am

Good advice! Selecting the “real” mode gets rid of the error for me as well in Visual Studio.

swajnaut_cz · October 24, 2014, 2:14pm

I had the same error, turned out that I was compiling for architecture SM_50 while my GPU only supports SM_35.

Topic		Replies	Views
Invalid device function CUDA Programming and Performance	10	6816	February 25, 2015
Kernel is not being launched. SDK kernels get launched. Mine doesn't. CUDA Programming and Performance	4	2618	July 22, 2010
(error 98) due to "invalid device function" for a very simple templated kernel example CUDA Programming and Performance cuda , kernel	3	3518	July 8, 2020
Invalid Device Kernel after upgrade to Cuda 10.x CUDA Programming and Performance	9	1655	May 9, 2019
cudaErrorInvalidDeviceFunction CUDA Programming and Performance cuda , jetson	6	2708	September 26, 2022
Undefined reference to a CUDA function in a dll (MSVC 2017 + MingW) CUDA Programming and Performance	10	1916	May 18, 2018
Exception on first CUDA call/Kernel fails to run SDK and Memcpy work though.... CUDA Programming and Performance	0	6293	January 22, 2009
strange behavior with device emulation CUDA Programming and Performance	5	2693	May 20, 2008
Kernel Launch Failure Very simple kernel CUDA Programming and Performance	3	3890	September 14, 2011
Invalid device function CUDA Programming and Performance	10	6455	November 19, 2008

Invalid Device Function

Related topics