Since a cudaBindTexture call is not specific to a stream, I am wondering if both bind and unbind calls have implicit device-wide synchronization. The Cuda C manual does not specify this in the implicit synchronization section when discussing streams, so I am a bit confused. For instance, it would not seem you could bind a texture, execute a kernel in one stream, then bind that same texture again (to a different piece of memory) and execute another kernel in another stream without at least some kind of stream synchronization (or device wide synchronization). There must be some documentation on this. Can someone point me in the right direction? Thank you in advance for your help.
(not trying to cross-post - originally put in the wrong bucket)