Cudnn call with cuda streams

abishek · July 6, 2018, 10:38am

My aim is to run multiple ConvForward CUDNN calls parallely, all have the access to independent data and creation of different CUDNN handles is not a problem.

So far I have wrapped the whole of CUDNN calls under a class, and how do I pass the specific class object into different CUDA streams or is there any other smarter way to parallelize the ConvForward other than wrapping it up in a class and creating an object to the class and passing it into streams?

Any discussion leading to a solution is way much appreciated.

abishek · July 13, 2018, 12:42pm

Update: A quote from the documentation of the CUDNN states that the CUDNN function calls can be sort of parallelized using the CUDA streams and also using multiple host threads. Is there any documentation which could explain it.

Topic		Replies	Views
How to execute multiple cudnn-forward function concurrently cuDNN	0	384	May 25, 2019
cuDNN Stream Priority cuDNN cudnn	5	1010	May 17, 2021
CUDNN and multi-GPU parallelism GPU-Accelerated Libraries	1	2636	February 22, 2016
CUDA per-thread and cudnn behaviour CUDA Programming and Performance	1	1280	September 15, 2017
Does cudnnCreate() call create multiple streams internally? cuDNN cudnn	1	978	November 23, 2020
Cudnn_status_execution_failed cuDNN	1	701	January 20, 2021
Multiple CUFFT in different streams? CUDA Programming and Performance	7	7063	July 5, 2008
CUDA Streams CUDA Programming and Performance	2	3374	December 17, 2009
Threading and streams cudaStreamSynchronize CUDA Programming and Performance	4	3580	July 16, 2008
Persistent kernel + cuDNN cuDNN	1	1143	October 19, 2022

Cudnn call with cuda streams

Related topics