Video Codec SDK 9.0.20 Samples using cuCtxCreate

excubiteur · August 16, 2019, 10:37am

The samples in Video Codec SDK 9.0.20 use cuCtxCreate.

From CUDA Runtime API :: CUDA Toolkit Documentation

Note that the use of multiple CUcontext s per device within a single process will substantially degrade performance and is strongly discouraged. Instead, it is highly recommended that the implicit one-to-one device-to-context mapping for the process provided by the CUDA Runtime API be used.

From CUDA Driver API :: CUDA Toolkit Documentation

Note:
In most cases it is recommended to use cuDevicePrimaryCtxRetain.

Is the Video Codec SDK exempt from this recommendation?

Robert_Crovella · August 16, 2019, 2:05pm

Are you saying it is using multiple CUcontext per device within a single process? I don’t think it is.

excubiteur · August 16, 2019, 7:27pm

The official Nvidia reference entry for cuCtxCreate recommends using cuDevicePrimaryCtxRetain. It does not make clear when to prefer to use cuCtxCreate instead.
A sample for an Nvidia library uses cuCtxCreate. I asked why - in a very simple and direct way : "Is the Video Codec SDK exempt from this recommendation? ".
You respond: "Are you saying it is using multiple CUcontext per device within a single process? I don’t think it is. "
Since it is not a direct answer to a simple direct question, I will interpret it and hope for confirmation that the interpretation is right or for someone to correct it if otherwise.
My interpretation is this:

a) Since it is clear in the samples that only one instance of a context was created, cuCtxCreate can be used to no ill effect.

b) More importantly (in my case, at least), cuDevicePrimaryCtxRetain could also have been used.

c) There is nothing special about the Video Codec SDK. The recommendation to use cuDevicePrimaryCtxRetain applies here as well.

d) In larger applications where different modules are worked on by different people, always use cuDevicePrimaryCtxRetain, even when using the Video Codec SDK, to avoid creating multiple contexts.

e) The Video Codec SDK samples do not exemplify cases where you have to use cuCtxCreate instead of cuDevicePrimaryCtxRetain.

Robert_Crovella · August 16, 2019, 8:36pm

I’m referring to the first sentence from your quote:

"Note that the use of multiple CUcontext s per device within a single process will substantially degrade performance and is strongly discouraged. "

That is a correct statement. It should convey understanding. I don’t think the samples are doing that.

It’s OK to use cuCtxCreate if you create 1 context and stay within that. It’s not necessary to convert to use of the primary context. Many many CUDA driver API applications work this way, and it’s not necessary for them all to be rewritten to use the primary context nor issue “an exemption”.

So I won’t be able to answer your yes/no direct question about whether the sdk samples have an exemption. If you need an answer to that, I won’t be able to provide it.

excubiteur · August 17, 2019, 3:18am

It seems that the responses I get are only to the first part of any of my posts.

To repeat (paraphrased) the second part of my last post:

Are these assertions correct? If not, can anybody please correct them?

a) Since it is clear in the samples that only one instance of a context was created, cuCtxCreate can be used to no ill effect.

b) More importantly (in my case, at least), cuDevicePrimaryCtxRetain could also have been used.

c) There is nothing special about the Video Codec SDK (in general, not specifically about the samples). The recommendation to use cuDevicePrimaryCtxRetain applies here as well.

d) In larger applications where different modules are worked on by different people, always use cuDevicePrimaryCtxRetain, even when using the Video Codec SDK, to avoid creating multiple contexts.

e) The Video Codec SDK samples do not exemplify cases where you have to use cuCtxCreate instead of cuDevicePrimaryCtxRetain.

Topic		Replies	Views
cuDevicePrimaryCtxRetain vs cuCtxCreate GPU-Accelerated Libraries	4	1692	August 17, 2019
Multiple CUDA contexts per device in a single process CUDA Programming and Performance	2	4897	April 22, 2016
Differences between cuD3D9CtxCreate, cuD3D9CtxCreateOnDevice and cuCtxCreate? CUDA Programming and Performance	0	1056	February 14, 2013
When is it necessary to create Multiple CUcontext s per device within a single process? GPU-Accelerated Libraries	0	506	August 17, 2019
CUDA driver API - multiple threads with the same CuContext CUDA Programming and Performance cuda	7	2339	October 28, 2022
CUDA,Context and Threading CUDA Programming and Performance	6	19534	May 29, 2012
Want examples of using multiple Contexts. CUDA Programming and Performance	1	481	August 18, 2019
Problems creating green context CUDA Programming and Performance cuda , jetson	4	32	April 29, 2025
cudaGraphicsMapResources() and cuCtxCreate() incompatible? CUDA Programming and Performance	9	1862	April 7, 2018
CUDA Context-Independent Module Loading Technical Blog	2	558	February 21, 2024

Video Codec SDK 9.0.20 Samples using cuCtxCreate

Related topics