Anyway to tell Cuda run host function launched in different stream in multiple threads(such as a threadpool)

SparkHu · December 2, 2022, 8:54am

It seems all host function launched by cudaLaunchHostFunc within different stream is executed in single thread sequentially.
I don’t find any runtime API to configure CUDA to use a thread pool.
Anyway to do this?

guilhermehartmann · July 11, 2023, 10:19am

Hello @SparkHu , Any luck with this ?

Robert_Crovella · July 11, 2023, 8:44pm

a related thread: cudaLaunchHostFunc API example

Topic		Replies	Views
Do CUDA graph host nodes execute on more than one thread? CUDA Programming and Performance	1	441	October 22, 2020
cudaLaunchHostFunc API example CUDA Programming and Performance	17	5392	July 11, 2023
concurrent kernel execution using stream CUDA Programming and Performance	1	560	March 22, 2016
CUDA per-thread and cudnn behaviour CUDA Programming and Performance	1	1280	September 15, 2017
cuda kernels execution one by one - in sequential CUDA Programming and Performance	2	3374	January 27, 2012
How is CUDA stream implemented? CUDA Programming and Performance	1	434	November 4, 2021
CUDA context and multi-threading CUDA Programming and Performance	0	2688	June 17, 2009
Is it possible to run a cuda kernel on several cpu threads? and How it works? CUDA Programming and Performance	2	1686	December 8, 2014
Is it possible to make the host thread wait for one of a set of streams to finish, or for one of a s CUDA Programming and Performance	3	1248	December 1, 2014
CUDA Device Sharing CUDA Programming and Performance	0	7534	September 6, 2010

Anyway to tell Cuda run host function launched in different stream in multiple threads(such as a threadpool)

Related topics