At runtime: "Fatal error: Registered function 'nvkernel_xyz_foo_16_' not found in the CUBIN, error 1"

rommel.quintanilla.c · December 20, 2023, 6:41pm

Hi, I got this very weird error on a program that is using OpenMP for offloading to GPU:

Fatal error: Registered function 'nvkernel_xyz_foo_16_' not found in the CUBIN, error 1

I must to say that the actual code is quite large and complex (dozens of GPU kernels) and every attempt to create a small reproducible code was unsuccessful so far.

I said that the error is “weird”, because most of the kernels are generated correctly (according to the compiler feedback: 123, Generating “nvkernel_xyz_foo_16” GPU kernel). However, just a few are missed. I can verify that from the binary generated through the cuobjdump utility.

As the error says, how is it possible that a kernel is generated, even registered but not found in the final binary?

MatColgrove · December 20, 2023, 7:47pm

I’ve only seen this once before when a user was missing a “declare target” directive around a routine declaration so the device version of routine didn’t get created. So it might not be the kernel itself, but rather a routine it’s calling, or possibly a global variable such as a Fortron module variable.

-Mat

rommel.quintanilla.c · December 21, 2023, 2:09pm

Thanks for answering, Mat. My bad, I forgot to mention that I’m offloading a Fortran application.

But, the problem still persists, I’ve double-checked the missing kernels, and they are pretty similar to the others that are included in the cubin file. So, there is nothing special about those omitted kernels.

I’m trying to create again a small reproducible example focusing on the module global variable, though.

MatColgrove · December 21, 2023, 4:53pm

That would be great. The symbol name, i.e. the “16_” suffix, implies that it’s a module variable that missing.

Note that I’m on vacation for the holiday break, so my not respond until next year.

Topic		Replies	Views
At runtime: “Fatal error: Registered function ‘nvkernel_xyz_foo_2_’ not found in the CUBIN, error 1” nvc, nvc++ and nvfortran	3	412	February 19, 2024
Fatal error: Registered function 'nvkernel_bc_omp__F1L2563_13_' not found in the CUBIN, error 1 nvc, nvc++ and nvfortran	2	548	March 5, 2021
Missing kernels in a .cubin file? Not all of my kernels are showing up in a .cubin file CUDA Programming and Performance	5	9239	February 5, 2009
OpenMP offloading with nvc++ compiler wont run on GPU nvc, nvc++ and nvfortran	1	785	August 14, 2023
OpenMP: cuModuleGetGlobal returned error 500 nvc, nvc++ and nvfortran	9	981	November 1, 2021
cuModuleGetGlobal_v2 not located CUDA Programming and Performance	1	2407	March 31, 2011
[driver api][ptx]: cuModuleGetFunction fails with CUDA_ERROR_NOT_FOUND ... but cuModuleGetGlobal wor CUDA Programming and Performance	8	7345	September 13, 2010
cuModuleGetFunction in CUDA 2.0 B2 bug Does not accept original function name CUDA Programming and Performance	2	1872	June 22, 2008
Pycuda gives 'cuModuleGetFunction named symbol not found error !!! CUDA Programming and Performance	0	2341	September 21, 2017
error F0004 : Unable to open MODULE file Legacy PGI Compilers	11	9966	June 5, 2012

At runtime: "Fatal error: Registered function 'nvkernel_xyz_foo_16_' not found in the CUBIN, error 1"

Related topics