Cuda doing screen capture

tgraupmann · September 17, 2008, 6:31am

Is Cuda capable of doing a screen capture and saving the result to an image file?

I’m not sure if file I/O can happen via the GPU. But maybe at least the GPU might scale the screen before saving. Or possibly do the expensive compression to get a really fast capture rate.

alex_dubinsky · September 17, 2008, 6:44am

No, not even close. Look into FRAPS.

StickGuy · September 17, 2008, 8:10am

Maybe if you were crazy enough you could use a PBO with glReadPixels to load the frame buffer into CUDA and then use the CUDA accelerated Dirac codec to make a movie out of it. I’m not sure how much of the dirac processing is done on the GPU though.

tgraupmann · September 17, 2008, 3:42pm

I can already do screen capture with Win32. But I figure the blit from screen to file is a perfect operation for a multi-gpu.

tgraupmann · September 17, 2008, 3:45pm

glReadPixels, that sounds like OpenGL. I thought Cuda and OpenGL didn’t play together. Wouldn’t that be slow too. There should be a DirectX counterpart.

_Big_Mac · September 17, 2008, 3:57pm

A GPU has no direct access to external devices (in this case HDD) on its own, all disk writes have to go through CPU.

CUDA can interoperate with OpenGL, there are special functions for that. Whether it’s fast or efficient is another thing.

tgraupmann · September 18, 2008, 2:54am

CUDA should have access to the DirectX capture surface and can perform scale operations and potentially output to a new surface that is saved. That would save some CPU time.

tgraupmann · September 19, 2008, 4:45pm

Is it possible to get pixels from a Cuda surface faster than the Win32 buffer?

apaehler · September 22, 2008, 1:14pm

Below I attach some Python code (an extended translation of one of the CUDA SDK examples). This uses OpenGL (via my own python-ogl bindings - you can substitute PyOpenGL for that) and Qt with the QtOpenGL widget. The “screen capture” is a single of code image = self.grabFrameBuffer() within this code and the image saved as PNG. You can probably substitute your own favorite programming language for that (C/C++/Fortran). You may have to reproduce what grabFrameBuffer does, presumably, as others have pointed out glReadPixels. But the short summary answer is: yes, you can do screen capture when using CUDA, although not in CUDA itself. The example below using vertex buffer objects, to store the result of the CUDA computation. You may read this yourself with a cudaMemCopy, I guess - minus all color and lighting info. Note that if you use Qt, life is very simple and if you are not doing this commercially, you can use the GPL versions of Qt and PyQt - plus my CUDA bindings for Python which you can find at ftp://graviscom.com/pub/code/python-cuda/ or via http (graviscom.com/sources/rpms/python-cuda. CUDA and OpenGL work together very well, there are several examples in the CUDA SDK. Don*t know about DirectX, as I am not using that.

elif key == Qt.Key_P:

        image = self.grabFrameBuffer()

        image.save("cuda_GLimg.png","PNG")

#!/bin/env python

coding:utf-8: Â© Arno PÃ¤hler,2007-08

Qt4/PyQt4 version

from ctypes import *

import os,os.path

import time

from PyQt4 import QtCore,QtGui,QtOpenGL

from PyQt4.QtCore import Qt

from ogl.gl import *

from cuda.cuda_defs import *

from cuda.cuda_api import *

from cuda.cuda_utils import *

lib = CDLL(“./libkernelGL.so”)

kernel1 = lib.__device_stub_kernel1

kernel1.restype = None

kernel1.argtypes = [ c_void_p,c_uint,c_uint,c_float ]

kernel2 = lib.__device_stub_kernel2

kernel2.restype = None

kernel2.argtypes = [ c_void_p,c_uint,c_uint,c_float ]

Sf4 = 4*sizeof(c_float)

kernel = kernel2

class CudaGLWidget(QtOpenGL.QGLWidget):

def __init__(self,parent = None,name = None):

    QtOpenGL.QGLWidget.__init__(self,parent,name)

    self.setWindowTitle("Cuda GL Demo")

self.device = cuda_CUDA()

self.initializeGL gets called automatically

    # and implicitly creates the OpenGL context

    self.reset()

    self.setGeometry(QtCore.QRect(0,0.,self.width,self.height))

self.t0 = time.time()

    self.frames = 0

    self.startTimer(0)

def reset(self):

    self.mouse_buttons = -1

    self.last_x = 0.

    self.last_y = 0.

    self.rot_x = 0.

    self.rot_y = 0.

    self.trn_z = -3.

    self.anim = 0.

    self.scale = 1.

    self.play = True

self.width = 512

    self.height = 512

self.mesh_w = 256

    self.mesh_h = 256

    self.m_size = self.mesh_w*self.mesh_h

def runCuda(self):

    vptr = c_void_p()

    cudaGLMapBufferObject(byref(vptr),self.vbo)

block = dim3(16,16,1)

    grid = dim3(self.mesh_w/block.x,self.mesh_h/block.y,1)

    cudaConfigureCall(grid,block,0,0)

    kernel(vptr,self.mesh_w,self.mesh_h,self.anim)

cudaGLUnmapBufferObject(self.vbo)

def initializeGL(self):

    glDisable(GL_DEPTH_TEST)

    self.createVBO()

def paintGL(self):

    self.runCuda()

glClear(GL_COLOR_BUFFER_BIT|GL_DEPTH_BUFFER_BIT)

glMatrixMode(GL_MODELVIEW)

    glLoadIdentity()

    glTranslatef(0.,0.,self.trn_z)

    glRotatef(self.rot_x,1.,0.,0.)

    glRotatef(self.rot_y,0.,1.,0.)

    s = self.scale

    glScalef(s,s,s)

glBindBuffer(GL_ARRAY_BUFFER,self.vbo)

    glVertexPointer(4,GL_FLOAT,0,0)

glEnableClientState(GL_VERTEX_ARRAY)

    glColor3f(1.,1.,.75)

    glDrawArrays(GL_POINTS,0,self.m_size)

glDisableClientState(GL_VERTEX_ARRAY)

if self.play:

        self.anim += 0.01

def resizeGL(self,width,height):

    self.width = width

    self.height = height

    w = float(width)/float(height)

    h = 1.

glViewport(0,0,width,height)

    glMatrixMode(GL_PROJECTION)

    glLoadIdentity()

    glFrustum(-w,w,-h,h,2.,10.)

glMatrixMode(GL_MODELVIEW)

    glLoadIdentity()

    glTranslatef(0.,0.,-3.)

def createVBO(self):

    self.vbo = vbo = GLuint()

    glGenBuffers(1,byref(vbo))

    glBindBuffer(GL_ARRAY_BUFFER,vbo)

size = self.m_size*Sf4

    glBufferData(GL_ARRAY_BUFFER,size,0,GL_DYNAMIC_DRAW)

    b_ptr = glMapBuffer(GL_ARRAY_BUFFER,GL_WRITE_ONLY)

    print vbo,size,b_ptr

    glUnmapBuffer(GL_ARRAY_BUFFER)

    glBindBuffer(GL_ARRAY_BUFFER,0)

    cudaGLRegisterBufferObject(vbo)

def deleteVBO(self):

    vbo = self.vbo

    glBindBuffer(1,vbo)

    glDeleteBuffers(1,vbo)

cudaGLUnregisterBufferObject(vbo)

    self.vbo = None

def keyPressEvent(self,ev):

    key = ev.key()

    if key == Qt.Key_Escape \

    or key == Qt.Key_Q:

        self.deleteVBO()

        cudaThreadExit()

        exit()

    elif key == Qt.Key_R:

        self.reset()

    elif key == Qt.Key_S:

        self.play = not self.play

    elif key == Qt.Key_P:

        image = self.grabFrameBuffer()

        image.save("cuda_GLimg.png","PNG")

def mousePressEvent(self,ev):

    button = ev.button()

    if button == Qt.LeftButton:

        self.MouseButton = 0

    elif button == Qt.RightButton:

        self.MouseButton = 2

    else:

        self.MouseButton = 1

    self.last_x = ev.x()

    self.last_y = ev.y()

def mouseReleaseEvent(self,ev):

    self.MouseButton = -1

def mouseMoveEvent(self,ev):

    x,y = ev.x(),ev.y()

    dx = x-self.last_x

    dy = y-self.last_y

    mouse = self.MouseButton

    if mouse == 0:

        self.rot_x += .2*dy

        self.rot_y += .2*dx

    elif mouse == 2:

        self.trn_z += .01*dy

    self.last_x = x

    self.last_y = y

def wheelEvent(self,ev):

    delta = ev.delta() # usually +/- 120

    scale = self.scale

    if delta != 0:

        scale += 6./float(delta)

    self.scale = min(2.,max(.2,scale))

def timerEvent(self,ev):

    t = time.time()

    self.frames +=  1

    if (t-self.t0) >=  5.0:

        seconds = t-self.t0         

        fps = self.frames/seconds

        print "%d frames in %3.1f seconds = %6.1f fps" % (

            self.frames,seconds,fps);

        self.frames = 0

        self.t0 = t

    self.updateGL() # needed

##############################################################################

if name == ‘main’:

from sys import argv

app = QtGui.QApplication(argv)

widget = CudaGLWidget()

widget.show()

app.exec_()

tgraupmann · September 26, 2008, 7:28pm

Thanks apaehler now I know it is possible.

Forest · June 7, 2009, 9:07am

Hi tgraupmann:

Do you have found a solution about capture screen in cuda?

Could you share some idea with me.

I am also doing this work now, and I have not found a method after near one month.

Appreciate for your help.

Topic		Replies	Views
Processing Windows GDI drawing with CUDA Finding the way to do so at high speed CUDA Programming and Performance	3	6070	September 11, 2009
Any example on real time video processing CUDA Programming and Performance	12	3897	January 6, 2012
is it possible to do real-time processing of what is being sent to my monitor? CUDA Programming and Performance	10	6978	January 12, 2012
cudaGLMapBufferObject (and unmap) performance These calls take way too long CUDA Programming and Performance	47	76295	February 14, 2010
cudaGraphicsGLRegisterBuffer and unspecified driver error CUDA Programming and Performance	5	4627	September 22, 2011
CUDA and OpenGL - Beginner question CUDA Programming and Performance	15	4468	March 6, 2015
newbie question possible to use cuda for 3d visualization? CUDA Programming and Performance	9	6291	March 9, 2009
OpenGL & CUDA CUDA Programming and Performance	12	9848	January 16, 2009
display a buffer openGL/cuda question CUDA Programming and Performance	11	8176	May 13, 2008
Pass openGL data to CUDA. Question about speed. CUDA Programming and Performance	4	1876	August 22, 2016

Cuda doing screen capture

coding:utf-8: Â© Arno PÃ¤hler,2007-08

Qt4/PyQt4 version

self.initializeGL gets called automatically

Related topics