I’m experiencing issues with this GPU card when I try to use Streams on my code. I’ve checked the three requisites to stream (deviceOverlap OK, Kernel execution and Data Transfers to be overlapped occurring in different-non-default streams and host memory involved as pinned memory) but I can see on nVidia Profiler that overlapping between data transfers and kernels is unsuccessful.
At last I’ve tried to run this basic example just to make sure that it’s not my fault…
But it still does not work. Memcpy’s and Kernels do not overlap.
Does anyone knows if there is some kind of problems with this GPU to use Streams?