Hi. I have a new question about histograms. I run my program on a 1024*768 picture (lvl of grey).
Each thread (768) analyse 768 pixels of the picture and create its histogram in a specifik place in the global.
In one case, we stock the data in texture memory, and in the other, we stock it on global memory.
I run the program on a 8800 GTS (320 Mb) and on a 8800(GTX) and we obtain these results :
This time is in ms.
I notice that for the GTS, I have a change after 96 thread (the time becomes slower). I thought I will notice the same thing on GTX after 128 threads, but nothing.
How can we explain this difference ? (I hope detailled informations).
And I don’t understand why the difference is so important with 2 threads !