Texture references have to be declared globally in the .cu file.
With CUDA 4.0 I am now able to lauch the same kernel on different GPUs just using cudaSetDevice()without the need to have a multi-threaded (CPU) application.
I have found some hints that for a multi-threaded application with one GPU per thread this works. But I want a single threaded application with multiple GPUs.
How are texture references correctly handled in this context.
Is it sufficient to just bind/unbind the same texture reference for each GPU in turn?
And what about constant memory, also declared globally ? Can it be used at all ?
Are there any experiences with this ?
Any hint to some documentation would be highly appreciated ?!