Tips to avoid laggy display during long kernels

P.M · November 4, 2011, 5:44pm

Dear CUDA users,
I amcurrently doing image processing on GPU and I have one kernel that takes something like 500 to 700 milliseconds when running on big images.
The problem is that the whole display and even the mouse cursor are getting laggy (OS=windows 7)

My idea was to split my kernel in 4 or 8 kernel launches, hoping that the driver could refresh more often (between each kernel launch).
Unfortunately it does not help at all, so what else could I try to avoid this freezing display effect?
Note: I am prepared to trade performances for smoothness!

tera · November 4, 2011, 6:15pm

Insert a [font=“'Courier New”]cudaStreamQuery(0)[/font][font=“Arial”]after each kernel to prevent them from being sent together as one batch.[/font]

P.M · November 4, 2011, 6:25pm

Ok, I will try that ASAP. So do you think (know?) if splitting my kernel will solve my issue?

pQB · November 4, 2011, 6:35pm

It’s hard to say without know at least a sketch of your code.

[*]Are you doing Graphic Interoperatibility?[*]Can you overlap memory copies and kernel calls?

Regards!

P.M · November 4, 2011, 6:39pm

No graphic interop.
I don’t overlap mem transfer and kernels, but the data processed is quite small (512Â² images) and the transfer takes only a small percentage of the whole processing.

P.M · November 7, 2011, 8:17am

cudaStreamQuery(0) solved the issue. Thanks