OMP offloading crash with nvc

mathias.louboutin · November 11, 2022, 2:55pm

Hello

We have updated our compiler to the Nvidia HPC SDK version 22.9 and we ran into an issue with the offloading.

We have a minimal example there

That works fine with the SDK 22.7 but not the SDK 22.9.

Cheers

mfatica · November 11, 2022, 6:10pm

If you add -mp=gpu to your linking, it seems to work

#!/bin/bash
nvc -g -fPIC -gpu=pinned -mp=gpu -fast -shared omp_crash.c -lm -o omp_crash.so
nvc -O3 -g -mp=gpu -Wall main.c -ldl -o main
./main

$ sh run.sh
Running foo()…
DONE!

fabio8 · November 11, 2022, 6:28pm

Hi, unfortunately we don’t have control over the process performing the dynamic loading. In our case, it’s Python :) so we can’t easily do that. I’m guessing we could maybe use LD_PRELOAD but it gets nightmarish… and I’m not even that sure

context: this project GitHub - devitocodes/devito: Code generation framework for automated finite difference computation

mfatica · November 11, 2022, 6:56pm

There are three libraries needed.

LD_PRELOAD=libacchost.so:libaccdevaux.so:libaccdevice.so ./main
Running foo()…
DONE!

fabio8 · November 14, 2022, 7:53am

FWIW, this is a bit painful, for various reasons…

installation / configuration. Fine, we have Dockerfiles, but still…
maintainance. What if those libs change in the next release, or a new one is added

mfatica · November 14, 2022, 5:09pm

I would suggest to open a formal bug.
If you ldd omp_crash.so, those libraries are there and if you run with
LD_DEBUG=libs ./main
you can see that there are errors coming from pthreads.

191258:	./main: error: symbol lookup error: undefined symbol: pthread_atfork (fatal)
191258:	./main: error: symbol lookup error: undefined symbol: __dyn_pthread_atfork (fatal)

fabio8 · November 15, 2022, 8:02am

How do I open a bug report?

rs277 · November 15, 2022, 8:12am

Here.

mfatica · November 29, 2022, 7:37pm

It looks like the shared library works as expected from Python ( that you indicated was your real use case), at least from this simple example.

$ cat pydriver.py
from ctypes import cdll
lib = cdll.LoadLibrary(‘./omp_crash.so’)
lib.foo()

$ python pydriver.py
Running foo()…
DONE!

Topic		Replies	Views
Creating a shared library that utilises OpenMP offloading NVHPC 22.5 nvc, nvc++ and nvfortran	5	798	June 23, 2022
problem making shared library works with gcc but not nvcc CUDA Programming and Performance	1	2019	August 27, 2008
Static linking nvcc and openmp CUDA Programming and Performance	3	3752	January 28, 2010
bug building shared library shared library that worked with the 0.81 CUDA Programming and Performance	0	4111	July 4, 2007
Linking frustration -lcuda fails CUDA Programming and Performance	7	27600	November 27, 2009
Pthread problem? Unable to understand the source / reason of failure CUDA Programming and Performance	7	5614	November 6, 2010
Creating a shared library that utilises OpenMP offloading nvc, nvc++ and nvfortran	4	1334	May 25, 2022
CUDA compile trouble CUDA Programming and Performance	47	5330	November 8, 2010
openmp in CUDA openmp support in the host code CUDA Programming and Performance	3	22036	December 15, 2007
Compiling CUDA Program Compiling a program with NVCC CUDA Programming and Performance	3	2534	October 5, 2009

OMP offloading crash with nvc

Related topics