When declaring a global constant array, where does that array reside? I need the same constant data to be present on two gpus, but I am not sure how to accomplish this (using the runtime api).
I would think that an ideal situation would be that delcaring the global constant array would allocate space on both cards, and with the device set correctly for multiple threads, a call to cudaMemcpyToSymbol within each thread would load the data for each. Is this how the system works?