cudaGraphicsMapResources each frame or just once when cuda-opengl interop ? which better?

opengpu · December 5, 2023, 2:30am

both work, and saw demos diff:
some call cudaGraphicsMapResources after cudaGraphicsGLRegisterImage which only 1 time before the render-loop;
some call cudaGraphicsMapResources each frame, and cudaCreateSurfaceObject, cudaDestroySurfaceObject, cudaGraphicsUnmapResources all are called each frame.
Thanks!

opengpu · December 6, 2023, 2:35am

ppt from nvidia years ago do Map and Unmap EveryFrame, but it seems that not Map&Unmap everyFrame also works, and as Map&Unmap is heavy SO is it also OK to do Map&Unmap NOT everyFrame but with Register&Unregister in the init&release?
Thanks

Robert_Crovella · December 6, 2023, 2:43am

AFAIK it is undefined behavior to not do map and unmap every frame. That is:

when you want to access the data from OpenGL side, you must unmap it first (if it was previously mapped).
when you want to access the data from the CUDA side, you must map it first (if it has not been previously mapped, since the last unmap operation, or creation operation)
there is no guarantee that the address returned by the map operation will be the same, from one frame to the next.

I won’t be able to point you at documentation or justification or recent training material for this. You can find GTC material that covers guides for CUDA/Graphics interop. Do as you wish, of course.

Yes, I acknowledge in many cases it appears to work if you don’t follow this guide.

opengpu · December 6, 2023, 2:50am

thanks! so map everyFrame is the right way.
and is Map & Unmap Async? need to call cudaStreamSynchronize(0) after them?

moreover, need to call cudaStreamSynchronize(0) or cudaDeviceSynchronize after the Kernel?

Robert_Crovella · December 6, 2023, 2:59am

as far as I know, map and unmap are synchronous.

I don’t understand the other question.

If you are asking can you launch a kernel followed by an unmap, without an intervening sync call, I believe the answer is yes. That is safe.

And if that is what you were asking about here, I can say that I for one had no idea you were talking there about in the context of CUDA graphics interop where you are launching a kernel followed by an unmap call.

There is such a concept as multi-GPU interop which is more arcane. None of my response apply to that case.

opengpu · December 6, 2023, 3:05am

Mixing Graphics and Compute with Multiple GPUs - Part 1 (gputechconf.com)
This is enough if:
— You can use the map/unmap hints
— map/unmap is CPU asynchronous
— You are afraid of multiple threads
— You developed the whole application

Robert_Crovella · December 6, 2023, 3:08am

sure, map/unmap may be cpu synchronous or asynchronous. that isn’t the way I interpreted the question. The point is, with respect to GPU activity, it is synchronous. The unmap operation will not begin until the previous kernel call is complete. And if that is the case, then an intervening sync would be unnecessary and superfluous. And again, I am mostly speaking from the perspective of single GPU interop. If you are now asking about multi-GPU interop, please disregard all of my comments.

And if you are concerned about something else, then you have given no indication from what I can see, of what your actual concern is.

Good luck!

system · December 20, 2023, 3:08am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Inefficient CUDA and OpenGL Interop CUDA Programming and Performance	4	2272	December 5, 2012
When to call cudaMapResources? CUDA Programming and Performance	4	3155	February 28, 2010
OpenGL interop performance CUDA Programming and Performance	2	45	December 11, 2024
Mapped address consistency in Interop-ed buffer/array CUDA Programming and Performance	9	518	June 1, 2022
OpenGL & CUDA interop with surfaces slow... CUDA Programming and Performance	2	956	July 6, 2018
OpenGL Interoperability - Multi CPU-Threading CUDA Programming and Performance	1	2042	March 15, 2013
OpenGL interop performance issues again... (or rather, still...) CUDA Programming and Performance	7	2455	April 16, 2009
Expected CUDA-OpenGL interop behaviour in case of wgl_nv_gpu_affinity context OpenGL	0	852	April 27, 2022
cudaGraphicsUnmapResources and concurrent copy and execute CUDA Programming and Performance	2	7382	September 7, 2011
cudaGraphics Map/Unmap of D3D11 resources is slow CUDA Programming and Performance	0	120	June 6, 2024

cudaGraphicsMapResources each frame or just once when cuda-opengl interop ? which better?

Related topics