I’m trying to figure out the maximum number of threads that could be being executed at any given instant in time. A pdf from NVIDIA’s webinar says,
1 thread block = 32 threads = Warp
1 Warp is executed physically on a multiprocessor.
So, at any given instant in time,
N_threads_total = N_multiprocs * N_threads_per_warp
To take a specific example, on the GTX 280 there are 30 multiprocessors, and therefore 30*32 = 960 threads being executed any any given time. Does this look right to you guys?