Here’re the pages I have to show you:
If you download the file and take a glance at pages from 8 to 11, the guide talks about thread batching and memory model. On page 9 of the guide, it’s thread batching to show how the threads are actually packed into a block. This is something I don’t understand. What it confuses me more is that, when you look at the figure about memory model, there are only two threads between the registers and the local memory. My questions are:
From figure 2-1, how the threads exactly packed to a single block? It seems to me that there are fifteen threads mapped into the single block.
When you look at figure 2-2, why are there only two blocks in a grid, but there are six blocks in a grid? Would I possibly mix something up?
I was trying to draw a big picture of how the threads and blocks are arranged in a single memory. Let me know.