I am trying to transpose a 3D data set using CUDA. The CUDA code samples show how to transpose a 2D matrix, but if anyone can give any tips on how to expand that program or to use it so I can transpose a 3D matrix along a given dimension (x, y or z).
Basically this is part of a Poisson solver that I am designing as part of my graduate research.
This is extremely useful. Thank you so much. I’ve been trying to figure out why you are calculating all the coarse grid and the k1, k2 numbers. Any help on that front would be really appreciated.
I’m having some trouble running this function. Can you please post a sample test file that uses this function correctly? It doesn’t have to be overly complicated, just a simple main function with the necessary library includes and a correct call to the transpose kernel.