Cannot compile OpenMP directives to offload to Nvidia GPU from Windows 10

david.tronchoni · June 12, 2022, 3:41pm

Hello,

I hope someone can help me. I am new with cuda, not as new with OpenMP. I am trying to compile this file kernel.cu:

void cuda_matricesSubstract(int pIntCols, int pIntRows, float* pMatrixA, float* pMatrixB, float* pMatrixC)
{
#pragma omp parallel
{
int i, j;
#pragma omp target teams distribute parallel for map(to: i, j, pIntRows, pIntCols, pMatrixA, pMatrixB) map(tofrom: pMatrixC)
for (i = 0; i < pIntRows; i++)
for (j = 0; j < pIntCols; j++)
{
int lIntPos = i * pIntCols + j;
pMatrixC[lIntPos] = pMatrixA[lIntPos] - pMatrixB[lIntPos];
}
}
}

With CMD command:

nvcc -c kernel.cu -o kernel.o -Xcompiler " -openmp"

I obtain this error (I have translated it a little bit from spanish):

kernel.cu(26): error C3001: ‘target’: expected a name of an OpenMP directive

I have been trying different approaches but none worked.

I have:

Cuda v11.7
Windows 10
GPU NVIDIA GeForce GTX 1060.

I look forward to your help so that I can get deeper into cuda and NVIDIA.

Thanks in advance and best regards.

achartiernv · June 13, 2022, 6:15pm

Moved to CUDA Setup forum

Robert_Crovella · June 14, 2022, 6:40pm

cuda and OpenMP (including target offload) are mostly orthogonal (they don’t relate to each other).

nvcc is not the correct compiler to use, nor is the CUDA toolkit intended to support OpenMP target offload to a GPU.

You won’t be able to use the CUDA toolkit for what you are trying to do here.

If you have a windows compiler that supports OpenMP target offload to a GPU, use that, and follow the instructions provided by the provider of that compiler.

david.tronchoni · June 16, 2022, 6:46am

Hello Robert, thank you for your answer. I will find that compiler. Any idea of such a compiler?

Thanks in advance.

Robert_Crovella · June 16, 2022, 4:01pm

for windows? no. Google may help you locate something.

For linux, you could try the NVIDIA HPC SDK. See here.

user117802 · July 20, 2022, 6:48pm

Hi Robert,

Is it on the roadmap for HPC SDK to support the OpenMP + Offloading to GPU features?

Should I assume that performance of OMP+Offloading to Nvidia GPUs is comparable to OpenACC’s ?

Outside CUDA, what would the best in terms of performance offload coding? OpenACC?

OMP codes are very pervasive so support for OMP Offloading is probably a lower cost transition to using Nvidia GPUs from OMP ready codes.

Cheers
Michael

Robert_Crovella · July 20, 2022, 9:05pm

For linux, it’s already available. Please reread my previous comment. Also see here:

Use -⁠mp=gpu to parallelize OpenMP regions for offload to an NVIDIA GPU.

If you have questions about the HPC SDK and related compilers, I suggest asking those on the HPC Compilers forum.

user117802 · August 25, 2022, 9:05pm

Thanks Robert!

Topic		Replies	Views
Does nvc support GPU offloading with OpenMP nvc, nvc++ and nvfortran	2	979	December 14, 2020
Trouble trying to run nvCOMP + OpenMP with GPU Offloading CUDA Programming and Performance cuda , nvcomp	1	46	November 11, 2025
Compile error for OpenMP code with target offloading in nvhpc 20.11 nvc, nvc++ and nvfortran	3	1639	December 21, 2020
Can NVCC compile and/or generate GPU code using OpenMP? CUDA NVCC Compiler	1	1052	February 1, 2022
Clang Openmp Offloading CUDA Programming and Performance	0	1166	March 30, 2018
How use openmp in .cu file ? CUDA Programming and Performance	9	32071	March 31, 2010
Compile application with openmp target pragma nvc, nvc++ and nvfortran	7	2620	November 30, 2020
IS Offloading Fortran to GPU with nvfortran on older GPU possible (CC61) nvc, nvc++ and nvfortran	4	890	February 4, 2022
Enabling OpenMP offload breaks OpenACC code nvc, nvc++ and nvfortran	6	1350	December 1, 2021
CUDA+OpenMP+non-Gnu-compiler Having build problems with this combination CUDA Programming and Performance	0	1802	March 31, 2010

Cannot compile OpenMP directives to offload to Nvidia GPU from Windows 10

Related topics