Performance issues with CUDA 1.1 & 169.09 drivers Performance degradation on OGL interop.

pedro.leite · December 7, 2007, 5:05pm

Hello,

I have changed the cudaProcess kernel code just to see the overall fps on updating one PBO with input from another. It looks like this:

__global__ void cudaProcess(int* g_data, int* g_odata, int imgw, int imgh, int tilew, int r, float threshold, float highlight) {

    int tx = threadIdx.x;

    int ty = threadIdx.y;

    int bw = blockDim.x;

    int bh = blockDim.y;

    int x = blockIdx.x*bw + tx;

    int y = blockIdx.y*bh + ty;

   g_odata[y*imgw+x] = g_data[y*imgw+x];

}

With CUDA 1.0 and 162.01 drivers, the sample runs at 400fps on a 8800 GTX, without gpu interoperability, of course.

With CUDA 1.1 and 169.09 drivers, the sample runs at 70 fps on the same 8800 GTX, except that it is only used for CUDA computations (a 7900 GTX is used for display).

Also, even with everything (display and computation) running on the same 8800 GTX, the fps is about 75!!!

Why is this PBO-to-PBO copy with the new drivers causing such performance degradation?

MisterAnderson42 · December 8, 2007, 2:20am

Read the release notes, they explain why:
“o On systems with multiple GPUs installed or systems with multiple
monitors connected to a single GPU, OpenGL interoperability
always copies shared buffers through host memory.”

pedro.leite · December 8, 2007, 3:38pm

Ok, two of three points were ‘solved’, but what about

With CUDA 1.0 and 162.01 drivers, I got about 400fps on a single copy, and now only 75!

This stills a problem for me…

MisterAnderson42 · December 8, 2007, 5:08pm

By “everything (display and computation) running on the same 8800 GTX” do you mean a single machine with a single monitor and a single video card? Because if you still have 2 cards or 2 monitors attached to a single card, then the release note still applies.

pedro.leite · December 10, 2007, 5:13pm

OK, this completely “solved” my problem. Now I guess I’m gonna wait for a 1.2 release.
Thanks.

Topic		Replies	Views
CUDA-OpenGL interop performance CUDA Programming and Performance	2	2449	May 30, 2014
Cuda -> OpenGL bandwidth CUDA Programming and Performance	6	3243	August 21, 2008
OpenGL interop performance issues again... (or rather, still...) CUDA Programming and Performance	7	2455	April 16, 2009
A problem of CUDA & OpenGL interoperation CUDA Programming and Performance	4	3951	May 17, 2009
OpenGL interop performance problems CUDA Programming and Performance	2	1322	February 2, 2010
Strange performance regression with a single GPU context on a multi GPU host CUDA Programming and Performance	11	956	April 7, 2021
GL-Interoperability Slow? Especially cudaGLUnregisterBufferObject CUDA Programming and Performance	0	1355	June 22, 2007
compare performance across different GPU cards and how to figure out the frequency the GPU clock? CUDA Programming and Performance	4	9938	June 14, 2010
9800 GTX and CUDA performance problems Slower than 8800 GT in some cases CUDA Programming and Performance	10	15100	June 27, 2008
CUDA Multi-GPU with OpenGL interop CUDA Programming and Performance	8	13013	December 13, 2010

Performance issues with CUDA 1.1 & 169.09 drivers Performance degradation on OGL interop.

Related topics