What is captured in CUcontext? contexts, cost of module load

MichaelChampigny · October 14, 2008, 2:50pm

Can anyone list specifically what is captured in a CUcontext? I can see from examples that CUmodule and CUfunction can be bound to a CUcontext. Is there a complete list somewhere?

Also, what is the cost of performing a cuModuleLoad() and cuModuleGetFunction() inside a new context (created by cuCtxCreate) for every invocation of cuLaunchGrid()?

From what I can gather, CUDA loads the module from the .cubin and finds the kernel entry point. The former is probably quite slow while the latter I assume to be fast once the module is loaded. Would all of this overhead swamp a typical kernel execution initiated by cuLaunchGrid?

I’m only looking for a rough comparison here (i.e., depends on .cubin size, but does loading a module and looking up the function entry point take on the order of milliseconds)?

I’m afraid that having to perform the module load/unload and function entry lookup on each kernel invocation will harm latency, although if the kernel itself takes several milliseconds to execute I may be able to afford a few tens of microseconds.

Thanks for any insight…

Topic		Replies	Views
Are modules reference counted across host threads? threads, cuModuleLoad, cuModuleUnload CUDA Programming and Performance	0	2212	October 15, 2008
what does cuModuleLoad do? CUDA Programming and Performance	1	2953	January 21, 2020
cuModuleLoad caching? CUDA Programming and Performance	0	579	April 15, 2014
CUDA Module Management CUDA Programming and Performance	1	1228	October 18, 2010
CUDA context estate CUDA Programming and Performance	1	3367	December 22, 2009
When are kernels loaded on to the device mem? Any CUDA Guru knows this CUDA Programming and Performance	5	2411	July 6, 2009
CUDA Internals: cuModule & cuFunction CUDA Programming and Performance	3	5766	September 12, 2011
Using cubin files from kernels CUDA Programming and Performance	2	3856	November 6, 2008
Slow loading kernel to GPU CUDA Programming and Performance	11	12968	April 18, 2008
multiple modules CUDA Programming and Performance	0	1542	July 17, 2007

What is captured in CUcontext? contexts, cost of module load

Related topics