Using thrust::cuda::par with thrust::cuda::par.on

yavuz.soy · August 17, 2019, 3:17pm

Greetings,
I have been tasked to make a very old project heavily using thrust as non-blocking as possible, so I am throwing stream definitions left and right, however, at some point saw this with its own execution policy restricting to use a memory region.

thrust::transform_inclusive_scan( thrust::cuda::par(Allocator), input.begin(), input.end(), output.begin(), scanStencil(), thrust::plus<int>());

Is there a way to combine thrust::cuda::par.on(myFooStream) with thrust::cuda::par(Allocator) in a simple manner without writing my own execution policy backend?

Best,
(This is a duplicate of Using thrust::cuda::par with thrust::cuda::par.on - GPU-Accelerated Libraries - NVIDIA Developer Forums)

striker159 · August 19, 2019, 8:56am

Does this work?

thrust::cuda::par(allocator).on(stream)

Note, however, that in the latest thrust version thrust calls are still blocking with respect to host even when streams are used. To have non-blocking thrust calls on the host, you need to use the new asynchronous API. Release Notes :: CUDA Toolkit Documentation

yavuz.soy · August 20, 2019, 11:55pm

Greetings striker159,
It works perfectly, thank you very much, also I do appreciate your warning regarding the API.
Best,

eyalhir74 · August 21, 2019, 12:28pm

Hi,
I’ve found this --default-stream per-thread to be very very helpful feature from NVIDIA, exactly for the problem you’re describing.

thanks
Eyal

Topic		Replies	Views
Using thrust::cuda::par with thrust::cuda::par.on GPU-Accelerated Libraries	0	721	August 17, 2019
Thrust and streams CUDA Programming and Performance	4	5039	September 7, 2017
How to use thrust::async::for_each with cuda streams? CUDA Programming and Performance cuda	13	3938	May 12, 2021
GPU Pro Tip: CUDA 7 Streams Simplify Concurrency Technical Blog	51	2285	February 5, 2020
Getting thrust to work with streams. CUDA Programming and Performance	2	2310	September 1, 2015
using streams with Thrust thrust, stream CUDA Programming and Performance	1	5010	October 15, 2011
Thrust and concurrent execution on multi-GPU CUDA Programming and Performance	1	1541	February 21, 2018
Please help me understand some issues regarding concurrent kernel execution CUDA Programming and Performance	5	579	July 23, 2019
Async thrust operation launches appear serially processed in nsight systems CUDA Programming and Performance cuda , performance , parallel-computing	5	272	August 29, 2024
Thrust: Concurrency and Kernels CUDA Programming and Performance	3	792	June 12, 2023

Using thrust::cuda::par with thrust::cuda::par.on

Related topics