First of all, I’m using CUDA 2.3 on Windows7 (VS2008).
I have two questions:
- Is there a way to check how many registers are used to execute a kernel?
- I store read only data in textures. What is the most efficient way to store data on the device?
Should I use cudaArrays for my data or is it enough to allocate some memory (device float *d_dataPtr)?