My project is to research the hardware on GPU, so I want to know how cuda core works ( I read many papers, but I am also confused about this problem)
My GPU is GTX960M, GPU is GM107, it has 5 SM and each one has 128 cuda cores . I see it is a Maxwell structure, In one SM, 128 cude cores are divdided into 4groups and each group has their own warp scheduler
my questions are:
- is one cuda core execute one thread, so 32 cuda core execute 1 warp?
2.is one warp scheuler can execute one warp at a time, so in my gpu, one SM can execute 4 warps at a time?
3.how the cuda cores work in maxwell structure?
I am so confused,thank you very much