I just started developing in cuda and my first test programs worked fine.
I have a 2D-array of dimensions [n,t] in the memory, and for every n, I want to perform a certain function with the column vector. In my first tests, I just summed up all elements in the vector. But now, I want to call a function from library, passing the vector to the function.
Now the question is: do I have to compile the whole library with nvcc,(and even worse, add device-qualifiers to each function,) or can I just include the header and call the library functions directly from the kernel, without recompiling them?
Oh, and I’m working on Ubuntu 8.04 with gcc 4.2, if that matters.