I’m currently implementing pthreads to my existing CUDA code. I was wondering cause I’m a little confused. Ever since 3.0 came out we can run concurrent kernels or it’s just for the fermi arch?
I’m running on a GeForce GT 220. I wish to run 6 threads and each of them containing a kernel call. The code runs to a certain point and then crashes.