Nvidia HPC SDK, Device Offloading and Exclusive/Inclusive Scan

mfeuling01 · June 14, 2023, 9:16pm

Does the nvidia OpenMP device offloading capability support exclusive/inclusive scan from OpenMP 5.0? Described here: scan Directive.

I’d love to be able to use this in conjunction with #pragma omp loop. I have code that I want to easily switch between CPU/GPU with. Nvc++ recognizes that the innermost loop of my program (which contains the exclusive scan) needs to be parallelized across threads when targeting GPUs, but when targeting CPUs, parallelism is distributed across threads on the outer loop. For the CPU case, this allows the existing plain C++ serial exclusive scan innermost loop to run correctly without race conditions, but for the GPU case, I obviously don’t get the right answer. If I can’t use an OpenMP clause such as “scan” to maintain my identical codebase for CPU/GPU execution, does anyone have any suggestions to explore something else?

Thanks,
Matt

MatColgrove · June 20, 2023, 4:39pm

Hi Matt,

Sorry of the late response. Our offices were closed for a U.S. Holiday and I needed to double check with engineering.

Scan isn’t something we support yet and given engineering is focused on bug fixes and performance improvements, new features such as this may be awhile before we add it.

-Mat

mfeuling01 · June 20, 2023, 5:50pm

No problem. Bummer! Thanks for the info.

Thanks,
Matt

system · July 4, 2023, 5:51pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Does nvc support GPU offloading with OpenMP nvc, nvc++ and nvfortran	2	892	December 14, 2020
NVC doesn't support nested parallel regions on CPU nvc, nvc++ and nvfortran	6	1195	August 25, 2023
Nvc++ OpenMP error inside llc nvc, nvc++ and nvfortran	5	1104	June 1, 2021
Creating a shared library that utilises OpenMP offloading NVHPC 22.5 nvc, nvc++ and nvfortran	5	710	June 23, 2022
Compile error for OpenMP code with target offloading in nvhpc 20.11 nvc, nvc++ and nvfortran	3	1540	December 21, 2020
OpenMP doesn't work in a templated function CUDA Programming and Performance	4	2242	September 14, 2009
IS Offloading Fortran to GPU with nvfortran on older GPU possible (CC61) nvc, nvc++ and nvfortran	4	787	February 4, 2022
Issue with locally defined classes in OpenMP offload region (since NVHPC 22.5) nvc, nvc++ and nvfortran	7	1050	March 31, 2023
Compile application with openmp target pragma nvc, nvc++ and nvfortran	7	2331	November 30, 2020
OMP offloading crash with nvc CUDA NVCC Compiler nvcc , offload-features	8	891	November 29, 2022

Nvidia HPC SDK, Device Offloading and Exclusive/Inclusive Scan

Related topics