I’ve run into some problems dealing with several pointers in a kernel parameter list. It might boil down to my misunderstanding of pointers in CUDA.
Are pointers sometimes/always 64-bit, on both 32-bit and 64-bit hosts?
The matrix multiply driver API example uses cuParamSeti (32-bit integer) and separates pointers by only 4 bytes. However, other posts on this forum say that the separation must be 8 bytes and this matches some of my past experience. In which case, what happens to the other 4 bytes that cuParamSeti doesn’t touch?