Multiple kernels running concurrently

virtual.ramblings · June 12, 2023, 10:49am

Hi,

I wanted to know what is the expected behavior on Orin AGX when two different CPU processes launch kernels to the same stream (default). Is it possible for these to run in parallel on the GPU (space sharing)?

From the profiling results that Nsight systems show, they seem to be running simultaneously.

However, as per my understanding, they are supposed to be run in a time-shared fashion. Could you please help me understand this better?

AastaLLL · June 13, 2023, 1:55am

Hi,

GPU resource for two processes is expected to be time-sharing.
Could you share a reproducible source so we can check this with our internal team?

Thanks.

virtual.ramblings · June 13, 2023, 12:12pm

Sure, thank you. The scripts we use are here: conc_folder_from_orin

One of them is a DNN training workload (MobileNetv3) and the other is an inference workload (ResNet50). Both of them are run using a wrapper script and profiled using Nsight Systems.

Instructions:

Download datasets (Was unable to upload these because of the size)
GLD23k : GitHub - cvdfoundation/google-landmark: Dataset with 5 million images depicting human-made and natural landmarks spanning 200 thousand classes.
ImageNet: ImageNet Object Localization Challenge | Kaggle
Replace dataset path in exp_script.sh
Run the script: bash exp_script.sh

AastaLLL · June 14, 2023, 2:36am

Hi,

Thanks for sharing the code.

We will try to reproduce this internally and check with our internal team for details.
Will let you know the following.

Thanks.

AastaLLL · June 14, 2023, 5:14am

Hi,

We also observe the similar behavior in our environment.
Need to check with our internal team. Will get back to you later.

Thanks.

virtual.ramblings · June 14, 2023, 5:29am

Thanks for trying it. By similar behavior, do you mean kernels from both processes running simultaneously?
Is there a way to confirm which SMs the kernels are running on using NSight?

virtual.ramblings · June 21, 2023, 9:53pm

Hi, any updates on this?

AastaLLL · June 26, 2023, 4:22am

Hi,

Sorry that we are still checking the details with our internal team.
Will share more info with you later.

Thanks.

AastaLLL · June 27, 2023, 7:16am

Hi,

All the active CUDA contexts use the GPU in a time-sharing manner.

Our guess is that the resolution of the NSight Tool is bigger than the resolution at which the time sharing is happening.
So the tool gives the impression that everything is running in parallel.

Thanks.

virtual.ramblings · June 30, 2023, 6:56am

Thanks for the clarification. Is there any detailed documentation on the GPU time sharing? For instance, how long is the default time slice for a process etc.

AastaLLL · July 3, 2023, 5:35am

Hi,

Sorry that GPU low-level info is not publicly shared.
Thanks

system · July 26, 2023, 2:58am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Simultaneous execution of multiple kernels CUDA Programming and Performance	4	2638	December 24, 2008
Is it possible to execute two kernels concurrently? CUDA Programming and Performance	18	6764	July 2, 2010
Problem with multiple GPUs The multiple GPUs are not working in parallel CUDA Programming and Performance	6	1918	September 2, 2010
Concurrently kernels running on one device CUDA Programming and Performance	17	2885	March 2, 2010
Concurrent Kernel Execution on Fermi - confussion CUDA Programming and Performance	13	1744	October 10, 2011
Multiple kernels in flight? CUDA Programming and Performance	19	26994	August 28, 2007
the possibility of two CUDA program run in a GPU CUDA Programming and Performance	1	909	November 29, 2011
Concurrent kernel execution without stream CUDA Programming and Performance	7	2525	December 28, 2016
Behaviour in running two programs on single GPU(Tesla K40m)? CUDA Programming and Performance	2	737	December 5, 2014
GPU and CPU don't run in (pure) parallel ? CUDA Programming and Performance	24	20314	May 4, 2007

Multiple kernels running concurrently

Related topics