Well I am a little bit confused when it comes to the following facts;
1- A Streaming Multiprocessor has 8 scalar processors. so how can a warp having 32 threads run simultaneously? Does’nt that means 4 threads run simultaneously on a single scalar processor.
2- What is 768 threads then? It is mentioned at several places that each SM can accommodate up to 768 threads. But then what is warp?
3-My question is if I ask you what is the Maximum Number of threads that can run simultaneous on a single SM, what will be your answer 32 (one warp ) OR 768?