Lot’s of articles explain that during GPU operation, CPU load on one thread is high (e.g. 100%) due spin-wait poll. The following command should reduce this:
This works fine, but I use
clock() inside my host code to calculate performance once kernel is finished.
clock() however returns erratic (way too low) ticks.
Why is this?
I don’t see why reducing poll frequency CPU/GPU interferes with the ticks of the CPU.