Links to CUDA development tools

sumitg · March 16, 2009, 6:38pm

Please refer to this page for a reasonably comprehensive list of development tools, libraries, plugins for GPU computing using CUDA-enabled GPUs:

[url=“http://www.nvidia.com/object/tesla_software.html”]Page Not Found | NVIDIA

If we missed something, please post it on this thread.

seibert · March 17, 2009, 1:27am

I think it would be useful to link to MisterAnderson42’s GPUWorker class for spawning host threads to more easily manage multi-GPU programs. Unfortunately, it seems to be only referenced in the forum, and inside the source code for HOOMD.

Perhaps he could be persuaded to make a homepage for GPUWorker, and then you could link to that.

tmurray · April 7, 2009, 8:36pm

Komrade: a pretty neat C++ library for CUDA with a very silly name

worc1154 · June 4, 2009, 12:19pm

There is this debugging tool from the University of Oxford for viewing and comparing the contents of host and device memory.

[url=“http://www.oerc.ox.ac.uk/research/many-core-and-reconfigurable-supercomputing/memviewer”]http://www.oerc.ox.ac.uk/research/many-cor...uting/memviewer[/url][img]http://www.oerc.ox.ac.uk/personal-pages/daniel/MemView-Image-5.png[/img]

XenoByte · July 1, 2009, 2:47pm

[quote name=‘worc1154’ post=‘548755’ date=‘Jun 4 2009, 08:19 AM’]

There is this debugging tool from the University of Oxford for viewing and comparing the contents of host and device memory.

I couldn’t get the link provided to work.

Here is an updated url for the MemViewer tool: http://www.oerc.ox.ac.uk/research/many-cor…uting/memviewer

Gregory_Diamos · July 23, 2009, 9:12pm

Ocelot [url=“Google Code Archive - Long-term storage for Google Code Project Hosting.”]Google Code Archive - Long-term storage for Google Code Project Hosting. is an alternative to deviceemu. It executes CUDA programs one instruction at a time as they would be on a GPU with a very large warp size.

It has built in memory checking functionality that will detect if you use a host pointer in device code or write to memory that has not been allocated.

SPWorley · February 2, 2010, 6:46pm

AgPerfMon is a tool mostly for PhysX and graphics programmers, but it also reveals some low level CUDA kernel scheduling. It records timestamps, SM, and Warp IDs of running kernels and shows them on a timeline.

daniel.s · February 26, 2010, 12:08pm

A plugin for Eclipse for CUDA and/or QT development/compilation:

[url=“http://www.ai3.uni-bayreuth.de/software/eclipsecudaqt/index.php”]http://www.ai3.uni-bayreuth.de/software/ec...udaqt/index.php[/url]

kappa · June 12, 2010, 9:29pm

There are a few more that you should add:

Full support for .Net (full CUDA driver API access and more) (C# and Visual Basic Examples)
Full support for Perl (full CUDA driver API access and more–see below)
Full support for Python (full CUDA driver API access and more–see below)
Full access for Ruby to run CUDA via the CUDA driver API
Full access for Lua to run CUDA via the CUDA driver API
Source code for all Kappa library language bindings and keywords are available using the Kappa library installers.

Performance is usually comparable to C++ since this is a high-level interface–most CUDA API operations such as memory management and transfer and other CUDA API operations are performed by the Kappa C++ library. (Performance can be better than any single CUDA C/C++ SDK example since all CUDA best practices, memory mapping plus concurrent kernel execution are the default if supported by the GPU hardware.) Full multi-GPU and CUDA JIT is available for all language bindings.

Since the Kappa library uses a producer/consumer data flow scheduler, defaults to asynchronous CUDA kernel launches, and supports asynchronous CPU kernel and SQL operations, it can achieve full occupancy of CPU and GPU. The CUDA kernel launches are such that, on GF100 GPUs, concurrent kernel execution is automatic and the usual mode. This assumes that the GPU has occupancy available for that mixture of kernels. Whether CUDA kernels can execute concurrently becomes a (potentially nondeterministic) result of the dynamics of execution of host and GPU code that should always meet or exceed performance otherwise available.

For .Net, you can create .Net subclass instances to tie to the Kappa IO keyword and to receive exception notifications. These subclasses execute on the host thread associated to the GPU context so that the full CUDA API is accessible for that GPU context.

For the Perl and Python mentioned above, developers can use a mixture of CUDA C++ running on the GPU, and C++ (including OpenMP), Perl, or Python running on the host as a single integrated processing task.

Additional language bindings (non-tested–no examples) are available for invoking CUDA via the Kappa library from: Java, R, PHP, Octave/Matlab, TCL, allegrocl, chicken, guile, mzscheme, ocaml, and pike.

The Kappa library is commercial but the .Net, Perl, Python, Lua, Ruby, etc modules/packages, examples, and keyword source code are available under the MIT License.

salmanulhaq · August 1, 2010, 2:18pm

CUVI Lib v0.3 (Beta version) is a new library from TunaCode. You can download a copy from:

[url=“http://www.cuvilib.com/downloads”][b]http://www.cuvilib.com/downloads[/b][/url]

CUVI Lib (CUDA for Vision and Imaging Lib) is an add-on library for NPP (NVIDIA Performance Primitives) and includes several advanced computer vision and image processing functions presently not available in NPP

In the current release of CUVI Lib you will find:

Optical Flow (Horn & Shunck)
Optical Flow (Lucas & Kanade)
Discrete Wavelet Transform (Forward and Inverse)
Hough Transform
Hough Lines (Lines Detector)
Color Conversion (RGB-to-gray and RGBA-to-Gray)

Several more advanced features will be added to CUVI Lib in upcoming releases. A detailed function reference can be downloaded from:
www.cuvilib.com/cuvimanual.pdf

We are looking forward to hearing your feedback and guidance on our forums ([url=“CUVI - CUDA Vision & Imaging Library”][b]http://www.cuvilib.com/forums[/b][/url]) and look forward to make CUVI Lib a single complete source of computer vision and image processing functions implemented on the GPU.

salmanulhaq · August 1, 2010, 3:01pm

How does the binding work on it?

mozzis · September 17, 2010, 9:59pm

Links to the CUDA 32-bit and 64-bit toolkits do not work: result is a nearly blank page with File Not Found message.

mozzis · September 17, 2010, 9:59pm

Links to the CUDA 32-bit and 64-bit toolkits do not work: result is a nearly blank page with File Not Found message.

speedgo · May 15, 2011, 12:46am

An open source project, SGC Ruby CUDA, is made available at GitHub - xman/sgc-ruby-cuda: NO LONGER MAINTAINED. Ruby bindings for using Nvidia CUDA and the Ruby standard Gems repository.

It provides accesses to CUDA API in a Ruby program.

fbielejec · September 23, 2011, 9:36pm

CUDA Eclipse plugin: Fixstars Corporation
Yellow Dog Linux, tailored for CUDA development: Fixstars Corporation

Topic		Replies	Views
Debug tool Tool to visualise memory in host and kernel code CUDA Programming and Performance	0	1459	May 26, 2009
Debug tool Tool to visualise memory in host and kernel code CUDA Programming and Performance	0	2277	May 26, 2009
Debug tool Tool to visualise memory in host and kernel code CUDA Programming and Performance	0	1465	May 26, 2009
CUDACasts Episode #9: Explore GPU device memory with Nsight Eclipse Edition Technical Blog	0	399	August 25, 2020
CUDA Toolkit 3.0 update GPU HW debugging tools to replace device emulation CUDA Programming and Performance	44	29863	April 29, 2010
GPU (semi-) diagnostic tool(s) CUDA Memtest - WOOO! CUDA Programming and Performance	1	30897	February 6, 2010
Should I program with Driver API? newbie here CUDA Programming and Performance	8	2342	July 20, 2010
CUDA development tools additions What is missing from current CUDA debuggers? CUDA Programming and Performance	1	2879	September 26, 2011
Kappa library announcement The Kappa Framework for parallel components CUDA Programming and Performance	12	10825	September 24, 2010
About the CUDA-GDB category CUDA-GDB	1	1939	February 29, 2024

Links to CUDA development tools

Related topics