I’ve code which uses some texture references. This code will be executed by (up to) 4 GPUs but has to run time delayed (so they are asynchronous to each other)
My problem is that every Thread uses the same texture references but the references have to be global and they are not allowed to be arrays.
I saw there are functions for creating a new context. A context is - according to the programing manual - like processes for the CPU with distinct 32bit address spaces.
But how does this help? I output the adress of my
but the address is always the same in all of the threads.
Am I doing something wrong? Or didn’t I get it how to use a context correctly?
Additional: Is the Cuda FFT-lib capable of being multi threaded?
Currently I’m working with a quite complicated solution which forks the main application and synchronizes the processes via Linux shared-memory.