Extending CUDA API Allocating "used" host memory

sietsch · September 22, 2009, 11:34am

Hi there,

imagine someone’s missing a certain functionallity in CUDA.
Presuming good programming skills, how would one implement… let’s say an extension for CUDA by either extending the CUDA API or writing an additional, rather small API.

Thanks for dropping me your thoughts and opinions,
Sietsch.

Gregory_Diamos · September 22, 2009, 6:18pm

This would depend on whether or not you need access to the internal data structures used by the runtime. For example, if you need to get access to the allocated memory maps on the device, then you would have to reimplement either the runtime API or the driver API. If you need some other functionality that does not require interacting with the functionality in the current API, then it would be much simpler to just create a library of your own. If you want to see what it would take to implement the Cuda API from scratch, you could take a look at some of our code here:

http://code.google.com/p/gpuocelot/source/…implementation/

specifically

http://code.google.com/p/gpuocelot/source/…aRuntimeApi.cpp

and

http://code.google.com/p/gpuocelot/source/…RuntimeBase.cpp

You might want to take a look at lines 1614-1643. We added two additional API calls to allow trace generators to be bound to kernels as they are launched.

Topic		Replies	Views
Should I program with Driver API? newbie here CUDA Programming and Performance	8	2318	July 20, 2010
Beginner - Memory Managment - cuda API - unused GPU memory CUDA Programming and Performance	9	2625	February 17, 2010
Does runtime API will call drive API? CUDA Programming and Performance	2	217	April 11, 2024
difference between "Runtime API Reference" and "Driver API Reference" in the Ref CUDA Programming and Performance	4	4017	March 5, 2009
CUDA: OUT OF MEMORY CUDA Programming and Performance	4	1772	August 25, 2010
cudaMalloc() vs Malloc() in pure C CUDA Programming and Performance	5	590	September 18, 2024
Building containerized gpus from a single physical gpu - cudart static linking troubles CUDA Programming and Performance	3	926	April 22, 2018
Cuda runtime call after driver api call, excessive overhead CUDA Programming and Performance cuda , driver , api	17	2174	December 24, 2021
cuMemAlloc hook does not work on cudaMalloc CUDA Programming and Performance	4	734	October 26, 2022
about the function of cudamalloc CUDA Programming and Performance	5	568	March 20, 2019

Extending CUDA API Allocating "used" host memory

Related topics