Lowest-latency method to display an image in real-time

BareMetalCoder · February 23, 2022, 3:03pm

Yet another post on my latency deep dive.
Our application produces an output image using CUDA from several synchronized camera feeds. The resulting image is in GPU memory.

I would like to display the image on-screen as fast as possible.

I currently use OpenGL/GLUT in a C/C++/CUDA program to display the image by first copying it to a texture and then rendering the texture on a GL_QUAD that fills the screen, followed by glutSwapBuffers(). I’m the first to admit that my OpenGL chops are a bit out of date.

Is there a faster way to render to the screen on the Jetson AGX Xavier? I.e. can I somehow write to the screen’s framebuffer directly?
I’m not necessarily bound to OpenGL.

Thanks!

dusty_nv · February 23, 2022, 8:19pm

Hi @BareMetalCoder, you can find various CUDA<->OpenGL interoperability samples under /usr/local/cuda/samples/2_Graphics

There is also API documentation available here:

CUDA<->OpenGL interoperability enables you to eliminate extra CPU copy of the data. For reference, this texture class from jetson-utils also does it: https://github.com/dusty-nv/jetson-utils/blob/master/display/glTexture.h

You might want to look into the Direct Rendering Manager (DRM) from the Jetson Multimedia API - https://docs.nvidia.com/jetson/l4t-multimedia/group__direct__rendering__manager.html

BareMetalCoder · February 23, 2022, 8:30pm

Hi Dusty,

Thanks for the info. I’m already using the CUDA interop to copy from my cuda buffers to textures directly – our data never touches CPU memory.

The DRM looks promising, though. Should I be looking at samples/common/classes/NvDrmRenderer.cpp and samples/08_video_dec_drm/video_dec_drm.cpp under the jetson_multimedia_api examples?

dusty_nv · February 23, 2022, 8:38pm

I’m not personally familiar with DRM, but it appears the header is found at /usr/src/jetson_multimedia_api/include/NvDrmRenderer.h and the 08_video_dec_drm sample uses it.

The Jetson Multimedia API samples also show the use of EGL for display. I’m not sure how the performance of that compares to OpenGL, but may also be something for you to look into.

BareMetalCoder · February 23, 2022, 8:40pm

Thanks, I’m poking through your texture class and there’s more there for me to learn than I thought.
Your examples are always very helpful, well written and appreciated!

system · March 9, 2022, 8:41pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
directly render from Device memory CUDA Programming and Performance	1	2864	January 18, 2010
Displaying to screen from CUDA CUDA Programming and Performance	7	5948	February 28, 2009
Direct-to-Display EGL Sample Implementation Jetson AGX Orin graphics , drm	4	199	July 17, 2024
How to Directly display on GPU without transfering data back to CPU CUDA Programming and Performance	5	4471	September 6, 2011
Display CUDA processed image Jetson TX1	5	1024	October 18, 2021
Get result image in a window? CUDA Programming and Performance	5	5615	March 2, 2009
rendering from textures CUDA Programming and Performance	3	1350	August 20, 2009
Writing Direct To Screen CUDA Programming and Performance	0	581	December 31, 2013
Fastest way to manipulate pixels with cuda + opengl + driver api ? CUDA Programming and Performance	0	2024	June 4, 2012
Cuda with image processing CUDA Programming and Performance	7	11742	January 17, 2012

Lowest-latency method to display an image in real-time

Related topics