How to Directly display on GPU without transfering data back to CPU

MasterKitten · August 22, 2011, 7:31pm

Hello Im using 9800GT and 9500GT as my GPUs and using windows as my OS. Starting from cuda 4.0 I read I can transfer data from
gpu to gpu so I was wondering if I could do math on one gpu and display on another. Problem is Im using CPUbitmap function provided by
NVIDIA and it seems that this certain function is the bottle neck of my program. Is there a way to directly display without transferring data back to cpu? Or is there better (faster) way of displaying on monitor than CPUbitmap? Thank you

Nighthawk13 · August 24, 2011, 9:09am

It should be possible to:
a.) Direct-copy data from the compute GPU to the display GPU (as cuda linear memory)
b.) Copy linear data into a OpenGL texture mapped as cuda array
c.) render the OpenGL texture
(Never tried it myself)

Or, easier to implement:
a.) Write data from compute GPU to host memory (mapped host memory or memcpy)
b.) upload from host memory to display GPU as texture
c.) render texture

See the simpleGL SDK example for the OpenGL+Cuda part.

alrikai · August 24, 2011, 6:05pm

There was an Nvidia presentation about CUDA <–> OpenGL interoperability a while ago; you can see the slides here. Not sure if this is what you’re looking for, but it might be of interest

MasterKitten · September 6, 2011, 2:25pm

My program is about 2D image. Im not sure if using OpenGL is good for it since it is optimized for 3D users right?? Does OpenGL work well with
2D images as well? Or is there better options?

alrikai · September 6, 2011, 4:21pm

OpenGL works fine with 2D images, although it can be a bit unwieldy. In all, when coupled with CUDA, either OpenGL or DirectX (if you’re only going to be on windows) are the way to go. Taking advantage of the cuda ↔ OpenGL/DirectX interoperability allows you to display things from your device memory w/o copying back to the host memory.

jack · September 6, 2011, 8:49pm

You can’t copy data directly from one GPU to another (i.e., without going through the host) unless you’re using two Fermi-based cards.

MasterKitten, have you tried just doing the math + display on a single card, and using OpenGL or DirectX interop to directly display the results of your calculations? If your display card has enough memory for what you need to do, it might be faster than offloading the math to the other card, precisely because of the bottleneck of going through the host (and all the synchronization that needs to take place between the two cards and the host).

Topic		Replies	Views
Cuda with image processing CUDA Programming and Performance	7	11694	January 17, 2012
Is there any way to use GPU memory directly ?! CUDA Programming and Performance	1	1676	July 31, 2008
communication b/w gpus and graphics display CUDA Programming and Performance	1	4715	May 7, 2008
Displaying to screen from CUDA CUDA Programming and Performance	7	5922	February 28, 2009
Display with CUDA Options to display with CUDA CUDA Programming and Performance	6	10656	February 4, 2008
directly render from Device memory CUDA Programming and Performance	1	2837	January 18, 2010
Load data on GPU, work on GPU and receive data from GPU. OPENGL or CUDA? CUDA Programming and Performance	3	1252	November 13, 2009
Displaying CUDA 2D data The process of moving CUDA data onto a texture or similar CUDA Programming and Performance	0	3446	June 3, 2010
Howto efficiently copy texdata from OpenGL to CUDA CUDA Programming and Performance	4	2827	March 5, 2008
CUDA and OpenGL data transfer CUDA Programming and Performance	9	21277	October 6, 2007

How to Directly display on GPU without transfering data back to CPU

Related topics