Exists a way to use cudaMalloc in two different CPU threads?
I’m trying to allocate memory in 2 GPU cards in parallel functions where the first line of my own functions is a cudaSetDevice invocation to select which card will be used by thread. When i compile my code the feedback shows which pthreads library isn’t available to nvcc. Streams are useful in this case?
Thanks for answer.