Fastest solution to present decoded frames with OpenGL with NVDEC

diederick · November 11, 2022, 11:20am

What would be the fastest solution to get a decoded picture (from NVDEC) into a texture that can be rendered with OpenGL?

The FramePresenterGL.h example from the video-skd-samples, uses a PBO for the OpenGL/CUDA interop. Then it copies the CUDA device buffer into the PBO (= first copy), then it uses glTexSubImage2D() to copy the data from the PBO into the texture (= second copy).

Is this the best we can do? Or can we maybe skip one copy? Maybe there are platform specific solutions?

ichlubna · November 15, 2022, 7:31am

I am interested in this too. Zero-copy access to the decoded surface should be physically possible since the resource is in the global memory right? Of course the user would have to ensure that the frames in DPB are not replaced by newly decoded ones. We would need something similar to cuGraphicsResourceGetMappedPointer but to work the other way around. To make the CUgraphicsResource from CUdeviceptr.

I would also love to know if the mapped decoded frame can be directly used in kernels (Simple casting can be used like here?) or as the input for FRUC library (possible duplicate).

For reference…I’ve been experimenting with FFmpeg and GPU accelerated decoding with VDPAU where I decoded the frame but had to convert it with VDP Video Mixer and pass it to GL using VDAPU Interop Extension. I believe that at least one copy happened in the mixer.

A zero-copy access should be possible in Vulkan.

Also good article here.

Topic		Replies	Views
OpenGL in 3.0 CUDA Programming and Performance	3	5190	March 26, 2010
The fastest way to decoded video frame to opengl texture? Jetson Xavier NX opengl	2	2175	February 9, 2022
Copy OpenCL image or buffer object to CUDA surface CUDA Programming and Performance	3	1373	March 29, 2017
Using NVDEC to transfer non-image data Video Processing & Optical Flow	4	1096	November 1, 2017
Encoding OpenGL textures live on Windows Video Processing & Optical Flow	0	880	April 3, 2019
CUDA for real-time video processing? CUDA Programming and Performance	1	4239	April 24, 2007
nvJPEG Encode directly from an OpenGL Texture? CUDA Programming and Performance	0	448	November 19, 2020
help with video cuda and open gl CUDA Programming and Performance	0	2389	February 9, 2010
Using GPU video decoding in openCL ? CUDA Programming and Performance	0	8237	February 17, 2011
NVCUVENC Encoding from OpenGL FBO CUDA Programming and Performance	1	1194	February 17, 2013

Fastest solution to present decoded frames with OpenGL with NVDEC

Related topics