I know it has been questioned and answered like a million times. But from all the posts I have read, I am not sure for what is wrong and what is right. So If someone could help me, it would be great.
I have a Tesla C1060 with 30 Multiprocessors and equivalently (30 *8 streaming processors each) 240 cuda cores. But my question now, is How many threads can I have, running concurrently?
From all I have read, I understand that in every clock cycle, I will have 308 = 240 threads running concurrently, and in every 4 clock cycles, 2404 = 960 threads will have complete the execution of 1 instruction.
Is this correct ? Or have I totally misunderstood ?
Thank you in advance for any help.