I’m using cudaMemcpy() to read data from the device.
This ruins my runtime.
My simplified code:
Sorry the system won’t let me insert the code. I’ll try again later.
As you can see the runtime jumps to 3000 which ruins my original (time crucial) program.
Is there any other way to get the data from the device?