Compile application with openmp target pragma

yaronk3 · November 22, 2020, 8:38am

Hi,
I’m trying to run an application that checks if it runs on GPU or CPU.
#include <stdio.h>
#include <omp.h>

int main()
{
int runningOnGPU = 0;
/* Test if GPU is available using OpenMP4.5 */
#pragma omp target map(from:runningOnGPU)
{
if (omp_is_initial_device() == 0)
runningOnGPU = 1;
}
/*If still running on CPU, GPU must not be available */
if (runningOnGPU)
printf(“### Able to use the GPU! ### \n”);
else
printf(“### Unable to use the GPU, using CPU! ###\n”);

return 0;
}
My OS is Ubunto 18.04 LTS, VGA card - Quadro P4000, the driver is nvidia-driver-440-server and Cuda version 10.0.

I tried to follow those documents but none of them worked.
https://freecompilercamp.org/llvm-openmp-build/ - segmentation fault with this build command

clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda ongpu.c

Home · clang-ykt/clang Wiki · GitHub - build error
Krister Walfridsson’s old blog: Building GCC with support for NVIDIA PTX offloading - compilation error

Where can I find a document with instructions on how to build a simple OpenMP application?

yaronk3 · November 22, 2020, 8:42am

I also tried this document
https://on-demand.gputechconf.com/gtc/2016/presentation/s6510-jeff-larkin-targeting-gpus-openmp.pdf - - compilation error

MatColgrove · November 23, 2020, 5:52pm

Hi yaronk3,

I’ll ping Jeff to see if he can help with the Clang issue since I only support the NV HPC Compilers.

As for the code, it does seem fine as it compiles and runs correctly with our pre-release of 20.11 (after I uncomment the section in the code). 20.11 will be released in a few weeks and contain our initial Beta implementation for OpenMP offload to GPUs.

% cat test.c
#include <stdio.h>
#include <omp.h>

int main()
{
int runningOnGPU = 0;
/* Test if GPU is available using OpenMP4.5 */
#pragma omp target map(from:runningOnGPU)
{
if (omp_is_initial_device() == 0)
runningOnGPU = 1;
}
/* If still running on CPU, GPU must not be available */
if (runningOnGPU)
printf("### Able to use the GPU! ### \n");
else
printf("### Unable to use the GPU, using CPU! ###\n");

return 0;
}
% nvc -mp test.c -V20.11; a.out
### Unable to use the GPU, using CPU! ###
% nvc -mp=gpu test.c -V20.11 ; a.out
### Able to use the GPU! ###

-Mat

jlarkinnv · November 23, 2020, 6:10pm

Hi @yaronk3. I just tried Mat’s corrected code with XL and Clang and was able to run and get the expected results. Can you please try it just to make sure the commenting wasn’t the problem? One issue I’ve hit with Clang before is that if I didn’t get the bytecode version of the OpenMP runtime, it failed to run. This gave an error about failing to offload when offload is mandatory.

I’ve recently began installing llvm/clang using Spack. If that’s an option for you, I’d suggest trying that. Something like spack install llvm@master+cuda cuda_arch=70, but please check the options carefully, because I’m working from memory.

yaronk3 · November 24, 2020, 7:42am

Hi Mat & Jeff,
The commenting isn’t the problem(I edited the question), When I compile C file with Clang and -fopenmp-targets=nvptx64-nvidia-cuda flag I got segmentation fault when running the application.
I’m new to working with GPU, Can you explain how you build Clang and what is XL?
Thanks for the help.

jlarkinnv · November 24, 2020, 6:05pm

The XL compiler is IBM’s compiler, so if you don’t have access to an IBM machine, that won’t help you. I used it as a sanity check to make sure that the program worked with multiple compilers.

I’ve had limited success building CLANG myself from sources, but I’ve had a lot of success doing it with spack (Getting Started — Spack 0.19.0.dev0 documentation). On my machine, which has a Volta-generation card, I built using the following command:

$ spack install llvm@master+cuda cuda_arch=70

After that build completed (it took quite a while) I was able to build your code with the command you listed above.

yaronk3 · November 25, 2020, 10:21am

I tried to use spcak but I still got a segmentation fault, When I use Cuda 11.1
“ykain@toronto:~$ /home/ykain/spack/opt/spack/linux-ubuntu18.04-skylake_avx512/gcc-7.5.0/llvm-master-56iodrir4cysgqwcpm6vjxfipk55bprj/bin/clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda --cuda-path=/usr/local/cuda ongpu.c -o ongpu
clang-12: warning: Unknown CUDA version. cuda.h: CUDA_VERSION=11010. Assuming the latest supported version 10.1 [-Wunknown-cuda-version]
ykain@toronto:~$ ./ongpu
Segmentation fault (core dumped)”
When I use Cuda 10.1 I got this error -
“/home/ykain/spack/opt/spack/linux-ubuntu18.04-skylake_avx512/gcc-7.5.0/llvm-master-56iodrir4cysgqwcpm6vjxfipk55bprj/bin/clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda --cuda-path=/usr/local/cuda ongpu.c
nvlink fatal : Input file ‘/home/ykain/spack/opt/spack/linux-ubuntu18.04-skylake_avx512/gcc-7.5.0/llvm-master-56iodrir4cysgqwcpm6vjxfipk55bprj/lib/libomptarget-nvptx.a:omptarget-nvptx_generated_cancel.cu.o’ newer than toolkit (111 vs 101)
clang-12: error: nvlink command failed with exit code 1 (use -v to see invocation)”

I added the lib directory to LD_LIBRARY_PATH environment argument.

jlarkinnv · November 30, 2020, 4:21pm

Hi. I’m sorry you’re still having troubles. At this point, we’re beyond my knowledge of CLANG. I would suggest reaching out to the LLVM support channels and see whether anyone there knows what’s going on. I can confirm that your program is working on my machine with CLANG/LLVM built via spack. The unknown cuda version message is a warning and doesn’t prevent successful building and running on my machine. Sorry I couldn’t help more.

Topic		Replies	Views
Nsight Compute does not detect kernel launches for OpenMP offloaded code Nsight Compute profiling	11	1553	February 28, 2023
How use openmp in .cu file ? CUDA Programming and Performance	9	31648	March 31, 2010
Nvc++ OpenACC runtime segfaults if Intel MKL (numpy) is already loaded nvc, nvc++ and nvfortran	8	1253	October 7, 2023
Creating a shared library that utilises OpenMP offloading NVHPC 22.5 nvc, nvc++ and nvfortran	5	727	June 23, 2022
Access to the generated code by OpenMP target pragmas nvc, nvc++ and nvfortran cuda	8	1222	March 1, 2023
Enabling OpenMP offload breaks OpenACC code nvc, nvc++ and nvfortran	6	1262	December 1, 2021
OpenMP doesn't work in a templated function CUDA Programming and Performance	4	2253	September 14, 2009
Compile error for OpenMP code with target offloading in nvhpc 20.11 nvc, nvc++ and nvfortran	3	1565	December 21, 2020
Unable to compile on login node nvc, nvc++ and nvfortran	13	599	May 2, 2024
Behaviour of OpenMP target maps with Fortran arrays nvc, nvc++ and nvfortran	12	65	February 11, 2025

Compile application with openmp target pragma

Related topics