first question: From my testing, i can see that when GPU deals with a kernel, it does’t support data transfer from CPU at the same time.
why cuda API (1.1) does’t support memory copy parallel with GPU computing?
second question: GPU does’t support data transfer from cpu to GPU parallell with data transfer from GPU to cpu, but i think DMA should support bidirectional data transmission.
does it hardware not support or API not support?
any one can give me explanation?
thank you in advance!
G80 is 1.0 hardware, so it does not support async memory copy operations. The fact that it’s reported as supported is a bug. However, for 1.1 (and later) hardwares (including G8X other than G80 and G92) it’s supported.
according to my testing, it indeed does’t support overlapping. the same code run on GTX280 (with capability 1.3), its result indicates the parallel very good.