Calling compiled CUDA files from Python

pwnagecorp2 · February 8, 2018, 7:41am

Based on this tutorial, it seems to be working perfectly… on UNIX.

http://bikulov.org/blog/2013/10/01/using-cuda-c-plus-plus-functions-in-python-via-star-dot-so-and-ctypes/

But I am on Windows and after compiling a .cu file into a .so or a .dll or .windll file, it seems uncallable from Python. It definitely registered as a WinDLL object as it gave me a handle and address, but the functioins inside the compiled file aren’t calling.

mydll
<WinDLL ‘C:.…kernel.so’, handle 7ffb07040000 at 0x19a5380c3c8>

I can’t call its functions nor find how to utilize it after scraping the internet and going through the python ctypes documentation several hundred times. What can be done?

any help is appreciated :)

PS I tried using both extern “C” { … } as well as extern “C” cudamain{ … } and other function names, to no avail.

Robert_Crovella · February 8, 2018, 8:52am

before tackling python via ctypes, I would suggest getting a windows DLL project working correctly in visual studio. There are plenty of examples on the internet of how to create a windows DLL that contains CUDA code. Once you’ve got that syntax figured out, it will demonstrate that the embedded functions in the DLL are visible/callable, and my guess is you will just sail through the python/ctypes at that point.

A quick google search of “cuda dll” turned this up as one of the top hits:

[url]c++ - Creating Cuda dll and using it on VC++ project - Stack Overflow

which seems to give a fairly complete example.

pwnagecorp2 · February 16, 2018, 12:53pm

Thanks for the suggestion, I’ve compiled a few projects on VS from CUDA samples, some of which contain ‘extern “C”’ in the .cpp file and still had no success accessing the .dll files from Python ctypes,

found that compiling the .ptx file contains the function names inside of it, and calling those didn’t work, even when adding the “shared” option to the compilation in settings.

Is this a windows limitation? because nvcc doesn’t have the “-fPIC” option on windows like it has on mac… at this point I’ve given up and figured I’ll just use C++ to command the cuda files instead of Python, but it’s a bit more of a hassle.

current VS compilation options:

export to .dll
shared
export .ptx file

is there something I’m missing??

Robert_Crovella · February 17, 2018, 4:06pm

Suggestion:

Learn how to compile and use a windows DLL containing CUDA C++ code, using an ordinary C (non-cuda) interface into the DLL. There are examples of this all over the web. Demonstrate that you can get this working.
Learn how to compile and use a windows DLL (no CUDA) using the Ctypes interface from python. This has nothing to do with CUDA. Demonstrate that you can get this working.

You shouldn’t have to export or mess with PTX to do any of this.

If you can get those 2 things working, it should be a relatively straightforward matter to get the combination working.

pwnagecorp2 · February 24, 2018, 9:26pm

Thanks a lot, I finally got it working! Turns out my .cu file was missing __declspec(dllexport).

For anyone looking at this in the future, you can put this on top of your cuda code:

define DLLEXPORT extern “C” __declspec(dllexport)

and put DLLEXPORT above every function you want to be accessible from Python ctypes. Just make sure you compile it to a shared library (use flag -shared if using nvcc on command line)

Also along the way I learned that you can circumvent the entire Visual Studio process, with a few lines in Python! Just write this:

import subprocess

nvcc_options_dll = [‘nvcc’,‘-shared’,r"C:\some_path\kernel.cu", ‘-O3’, ‘–ftz=true’, ‘–fmad=true’, ‘-arch=sm_61’, ‘-o’, r"C:\some_path\cresult.dll"] # some of these are specific to my machine, like compute capability 6.1 since i have a 1050ti

compile_cuda = subprocess.Popen(
nvcc_options_dll, stdin=subprocess.PIPE,
stdout=subprocess.PIPE, stderr=subprocess.PIPE)
out, err = compile_cuda.communicate()

and you will find a .dll in your directory, from where you can proceed with the first tutorial. I found this handy trick digging inside the source code for cuda4py, an amazing module.

Cheers, and happy CUDAing!

Topic		Replies	Views
accessing CUDA driver functions via ctypes on Windows CUDA Programming and Performance	1	796	November 5, 2014
cuda obj files in a dll trying to create a dll using .cu files CUDA Programming and Performance	15	49795	July 1, 2019
Make dll file from Cuda code CUDA Programming and Performance	1	1680	January 17, 2018
creating dll with cuda functions CUDA Programming and Performance	1	632	August 15, 2014
c# and cuda dll CUDA Programming and Performance	1	5539	February 15, 2010
Call C and CUDA from Python (Paraview Python) General Topics & Other SDKs	1	490	June 16, 2021
Cuda dll CUDA Programming and Performance	2	1320	February 22, 2023
Creating a CUDA DLL CUDA Programming and Performance	15	44307	August 12, 2019
c# with cuda c# can call a cuda dll file? CUDA Programming and Performance	7	10190	February 20, 2010
compile dll using nvcc CUDA Programming and Performance	4	1209	September 9, 2010

Calling compiled CUDA files from Python

Related topics