Cannot dynamically load a shared library containing both OpenACC and CUDA code

MatColgrove · April 12, 2022, 5:44pm

Hi Olli,

Same answer as your first question, compile the C++ code with “-gpu=nordc”. However this one’s a bit more tricky in that in order to call the CUDA device routine, you need RDC enabled so the device linker can resolve the symbol.

What I’d try is merging the cuda.cu file in with the directives.cpp file so the routine can be inlined instead of called thus removing the need for the link step. While not fully supported, nvc++ can compile CUDA code. However since nvc++ is a single pass compiler, it cannot not support “__CUDA_ARCH__”. Instead we’ve added a constexpr “if target(nv::target::is_device)” which can be evaluated at compile time to mimic the behavior.

Normally I’d write an example for you, but my home internet is currently down and I can’t get VPN to work through my tethered cell phone so can’t get to my systems. Once back, I’ll post a follow-up with an example. Though in the mean time, you can see an example if posted HERE.

-Mat

Topic		Replies	Views
Enabling OpenMP offload breaks OpenACC code nvc, nvc++ and nvfortran	6	1264	December 1, 2021
Regression with NVHPC 22.7 and OpenACC offload kernels nvc, nvc++ and nvfortran	3	397	October 4, 2022
OpenACC + CUDA implementation nvc, nvc++ and nvfortran	7	87	January 30, 2025
NVCC forces c++ compilation of .cu files CUDA Programming and Performance	11	25656	December 11, 2011
Error when running optimized code but runs fine with debug nvc, nvc++ and nvfortran	4	1368	August 30, 2022
Simple CUDA Wizard for Visual Studio 2005 CUDA Programming and Performance	100	174509	April 8, 2012
Shared library with openacc code and ccall only runs on hosts's gpu arch nvc, nvc++ and nvfortran	17	87	July 30, 2024
Dynamically loading an OpenACC-enabled shared library from an executable compiled with nvc++ does not work nvc, nvc++ and nvfortran	5	871	April 13, 2022
Ubuntu 20.04, GCC 9.3, Cuda Toolkit 11.3 - not a supported combination? CUDA Programming and Performance	11	8961	November 4, 2021
How to map private dynamic array to the GPU with OpenMP and nvc? nvc, nvc++ and nvfortran	20	127	January 31, 2025

Cannot dynamically load a shared library containing both OpenACC and CUDA code

Related topics