CUDA PTX advise help making a library of sorts for gpu structures

Grimace1975 · July 5, 2010, 8:00pm

Working on building a GPU-structures library for use in Kernels, providing simple memory allocation, and spin-locks and more advance data structures.

In a separate project I am working on a real-time simulator completely implemented in device code (to support a neural network), and will require these data structures to implement.

I know memory allocation, hash-tables, and spin-locks in device code is typically frowned upon, where current Kernel development tactics are:
serialized on host, parallelized on device, and shuffle between them for a solution. But I feel my need is great, and my data-sets are large, and will require these objects to stay and execute in device logic and memory.

From what I have learned so far, the only practical way of stitching this code together is in PTX with the .func entities, due to CUDA auto expanding all function calls. And there is not an effective way of mixing cu files and .func together, or of mixing multiple PTX files together, since there is no pre-processor for PTX files. But I may be way wrong and please correct me.

Project lives here:
[url=“Google Code Archive - Long-term storage for Google Code Project Hosting.”]Google Code Archive - Long-term storage for Google Code Project Hosting.

It is currently a visual studio 2008 project. Had to clone the CUDA build.rules and modify one, adding the .PTX extension to get studio to compile my .PTX files. Is there a better way of doing this.

Also making a concurrent implementation eventually for the ATI line with Cal files. Sorry NVIDIA, yours is more dominate tho.

Please provide any feedback.

Thank you,
Sky Morey
moreys@digitalev.com

Topic		Replies	Views
Translating CUDA Programs to Other Architectures than GPUs Tech Report CUDA Programming and Performance	3	4183	January 23, 2009
Ability to run PTX directly CUDA Programming and Performance	2	4391	November 11, 2009
self modifying code CUDA Programming and Performance	6	11119	April 16, 2008
Generate CUDA at run-time ? CUDA Programming and Performance	13	3066	September 28, 2011
How to implement lock on the gpu? CUDA Programming and Performance	5	5138	January 19, 2010
PTX Code Transformations CUDA Programming and Performance	2	5155	September 16, 2009
Dynamic Kernel Function Runtime code generation CUDA Programming and Performance	17	25680	March 26, 2013
L1 Cache, L2 Cache and Shared memory in Fermi CUDA Programming and Performance	5	23527	March 21, 2011
Programming CUDA at 'assembler' level? CUDA Programming and Performance	9	13489	November 7, 2010
Anyone optimize by modifying PTX or cubin code? CUDA Programming and Performance	3	887	October 4, 2010

CUDA PTX advise help making a library of sorts for gpu structures

Related topics