So I decided I wanted to try CUDA programming on my GTX950. I’ve read that the profiler is a really useful tool so I wrote a little program to try it out and I got some really weird results. All the program does is transfers a matrix from the host to the device, inverts it, and transfers it back 1000 times. The puzzling bit is that about half of the transfers move at 10GB/s and the other have average around 2GB/s. This is consistent regardless of the Gen I specify in my bios. Has anyone else experienced inconsistent memory transfers?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Handful of Slow Memory Transfers | 7 | 813 | June 17, 2016 | |
Inconsistant Memory Copy Speed | 14 | 8062 | May 20, 2009 | |
CUDA Profiler [memcopy] weird result | 7 | 7257 | November 8, 2007 | |
Visual Profiler displays erroneous output with multiple GPUs Profiler problem on multi-gpu scaling b | 0 | 791 | May 9, 2012 | |
Negative Global memory excess load Compute Visual Profiler | 0 | 10619 | May 23, 2011 | |
[SOLVED] cudaMemcpy down to 100Mbyte/s | 3 | 484 | September 4, 2019 | |
Profiler timing measurements wrong? | 0 | 1892 | June 3, 2015 | |
"Unified Memory Profiling is not supported ..." warning 3348 | 15 | 5756 | September 20, 2018 | |
Very slow memory transfer problem Simple program executes very slowly, bandwidth test shows normal r | 2 | 907 | February 7, 2011 | |
streams strange behaviour with profiler | 0 | 527 | November 25, 2014 |