Hi,
I’m trying to write a library, having some CUDA code inside and exporting methods, which invokes CUDA kernels. But then using these exported methods in exe, it fails to allocate device memory (same code running as exe (not exported outside of DLL) working fine). What I’m doing wrong ?
Thanks