I have a question regarding the cudaMallocHost() function. How ist this function implemented by Nvidia? Is there a possibillity to rewrite this function with own C or C++ code?
My CUDA application is a kind of add-on to a very big application. I can’t include the CUDA libs in the big application but I want the benefits from the PINNED mode. So the question is, is it possible to rewrite the cudaMallocHost() function with own code to be able to allocate memory in the PINNED mode? Than I could pass the pointer to this memory area easily to my CUDA add-on and benefit from the increased memCpy() performance.
Any suggestions or ideas?
Thanks in advance. Best regrads,