Hi people, I want to know if, in a global function runs by kernel<<<dimBlocks,dimThreads>>>(…);, the access in memory occurs in parallel or in sequential (maybe for the cause of bus???). Example:
__global__ void SumVect(...){
...
VettRis[TID]=Vett1[TID]+Vett2[TID];
...
}
The threads works in parallel or in sequential because they must accede in the same global memory?