I figured out that the setMatrix and getMatrix functions of cublas are a bottleneck in my app. So i wonder whether they’re using DMA or not.Unfortunately i hardly can find some information about that - it isn’t mentioned at all in the cublas manual. Perhaps some of you guys knows something about it?!?
thanks in advance,